Appearance-Based Virtual View Generation of Temporally-Varying Events from Multi-Camera Images in the 3D Room