In this paper, we discuss on an automatic and immediate metadata extraction method by heterogeneous sensors for meeting video streams. The main feature of our method is immediate and automatic extraction of metadata by giving semantics to combinations of heterogeneous sensors for meeting video streams. By this method, we can extract metadata automatically by semantics for combinations of heterogeneous sensor data immediately the target videos are captured. In this paper we describe the feasibility of our method by several experimental results.