An Automatic and Immediate Metadata Extraction Method by Heterogeneous Sensors for Meeting Video Streams