The application of protein-protein docking to the large-scale interactome analysis, the treatment of protein flexibility or multiple protein-protein docking problem are current challenges in structural bioinformatics that require huge computing resource. In this work we present MEGADOCK 4.0, an FFT-based docking software which makes extensive use of recent GPU supercomputers and show the powerful scalable performance of over 97% strong scaling with TSUBAME 2.5 supercomputing system. In addition, a million protein-protein docking jobs can be calculated about a half day by using 420 nodes of TSUBAME 2.5.