A new, linguistically annotated, video database for automatic sign language recognition is presented. The new RWTH-BOSTON-400 corpus, which consists of 843 sentences, several spea...
Philippe Dreuw, Carol Neidle, Vassilis Athitsos, S...
This paper describes the design and application of time-enhanced, finite state models of discourse cues to the automated segmentation of broadcast news. We describe our analysis o...
ThefieldofContent-BasedVisualInformationRetrieval(CBVIR)hasexperiencedtremendousgrowth in the recent years and many research groups are currently working on solutions to the proble...
In this paper we present a generative model and learning procedure for unsupervised video clustering into scenes. The work addresses two important problems: realistic modeling of ...
Nemanja Petrovic, Aleksandar Ivanovic, Nebojsa Joj...
The Carnegie Mellon University Informedia group has enjoyed consistent success with TRECVID interactive search using traditional storyboard interfaces for shot-based retrieval. Fo...