In this paper, we present an "appearance-based" virtual view generation method for temporally-varying events taken by multiple cameras of the "3D Room", developed by our group. With this method, we can generate images from any virtual view point between two selected real views. The virtual appearance view generation method is based on simple interpolation between two selected views. The correspondence between the views are automatically generated from the multiple images by use of the volumetric model shape reconstruction framework. Since the correspondences are obtained by the recovered volumetric model, even occluded regions in the views can be correctly interpolated in the virtual view images. The virtual view image sequences are presented for demonstrating the performance of the virtual view image generation in the 3D Room. This research was supported by Robotics Institute internal funds. Also, partial support was provided by Intel Corporation, Matsushita Elect...