Our goal is to automatically recognize and enroll new vocabulary in a multimodal interface. To accomplish this our technique aims to leverage the mutually disambiguating aspects o...
Video question answering aims to pinpoint answers in response to user's specified questions. However, most question answering technologies involve in integrating rich specifi...
Local space-time features have recently shown promising results within Bag-of-Features (BoF) approach to action recognition in video. Pure local features and descriptors, however,...
Muhammad Muneeb Ullah, Sobhan Naderi Parizi, Ivan ...
—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...
Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikipedia. We sketch an automated processing pipeline to extract relevant textual cont...
Dan Flickinger, Stephan Oepen, Gisle Ytrestø...