Sciweavers

165 search results - page 29 / 33
» Hybrid Hierarchical Learning from Dynamic Scenes
Sort
View
ICCV
2011
IEEE
12 years 7 months ago
Parsing Video Events with Goal inference and Intent Prediction
In this paper, we present an event parsing algorithm based on Stochastic Context Sensitive Grammar (SCSG) for understanding events, inferring the goal of agents, and predicting th...
Mingtao Pei, School of Computer Science, Yunde Jia...
MM
2004
ACM
178views Multimedia» more  MM 2004»
14 years 29 days ago
A bootstrapping framework for annotating and retrieving WWW images
Most current image retrieval systems and commercial search engines use mainly text annotations to index and retrieve WWW images. This research explores the use of machine learning...
HuaMin Feng, Rui Shi, Tat-Seng Chua
MM
2009
ACM
252views Multimedia» more  MM 2009»
14 years 2 months ago
Localizing volumetric motion for action recognition in realistic videos
This paper presents a novel motion localization approach for recognizing actions and events in real videos. Examples include StandUp and Kiss in Hollywood movies. The challenge ca...
Xiao Wu, Chong-Wah Ngo, Jintao Li, Yongdong Zhang
TVCG
2012
191views Hardware» more  TVCG 2012»
11 years 10 months ago
Live Speech Driven Head-and-Eye Motion Generators
—This paper describes a fully automated framework to generate realistic head motion, eye gaze, and eyelid motion simultaneously based on live (or recorded) speech input. Its cent...
Binh Huy Le, Xiaohan Ma, Zhigang Deng
AAAI
2008
13 years 10 months ago
Unstructured Audio Classification for Environment Recognition
My thesis aims to contribute towards building autonomous agents that are able to understand their surrounding environment through the use of both audio and visual information. To ...
Selina Chu