This paper introduces a novel method for generating an intermediate view of soccer scene taken by multiple video cameras. In the proposed method, soccer scene is classified into dynamic regions, a field region, and a background region. Using epipolar geometry in the first region and homography in the second, dense correspondence is obtained to interpolate views. For the third region, partial area images are extracted from the panoramic image compounded from the background of multiple views. Finally synthesizing them completes intermediate view images of the whole object. Applying this method to actual scenes of a soccer match captured at the stadium, we succeeded in generating natural intermediate view videos.