In this paper, we describe our approach and results for high-level feature extraction task at TRECVID 2007. This year, we adopted late fusion of several types of features. As a first step, we extract several types of visual features and ASR texts from the given movies, and apply SVM to them independently. As the next step, we fuse these results by linear combination with weights chosen by cross validation.
O. Liu, Z. Tang, K. Yanai