Our experiments in TRECVID 2007 include participation in the high-level feature extraction, search, and video summarization tasks, using a common system framework based on multiple parallel Self-Organizing Maps (SOMs). In the high-level feature extraction task, we applied a method of representing semantic concepts as class models on parallel SOMs, combined with external text search results. This year, we introduced a further post-processing stage in which the concepts’ temporal and inter-concept co-occurrences were analyzed. We submitted the following six runs: • A_PicSOM_1_6: Required visual baseline • A_PicSOM_2_5: Visual features and text search • A_PicSOM_3_3: Visual features using variable convolution and text search • A_PicSOM_4_4: Visual features using variable convolution • A_PicSOM_5_2: Visual features, text search, and temporal context based on training set • A_PicSOM_6_1: Visual features, text search, and temporal context based on validation set The results sh...