Sciweavers

168 search results - page 29 / 34
» Intelligent Visual Descriptor Extraction from Video Sequence...
Sort
View
ICIP
2007
IEEE
13 years 11 months ago
Lipreading by Locality Discriminant Graph
The major problem in building a good lipreading system is to extract effective visual features from enormous quantity of video sequences data. For appearance-based feature analysi...
Yun Fu, Xi Zhou, Ming Liu, Mark Hasegawa-Johnson, ...
MM
2010
ACM
198views Multimedia» more  MM 2010»
13 years 7 months ago
Analyzing and predicting sentiment of images on the social web
In this paper we study the connection between sentiment of images expressed in metadata and their visual content in the social photo sharing environment Flickr. To this end, we co...
Stefan Siersdorfer, Enrico Minack, Fan Deng, Jonat...
PAMI
2010
249views more  PAMI 2010»
13 years 6 months ago
A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models
—In this work we propose a dynamic-texture-based approach to the recognition of facial Action Units (AUs, atomic facial gestures) and their temporal models (i.e., sequences of te...
Sander Koelstra, Maja Pantic, Ioannis Patras
CVIU
2008
124views more  CVIU 2008»
13 years 7 months ago
Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news
News videos from different channels, languages are broadcast everyday, which provide abundant information for users. To effectively search, retrieve, browse and track news stories...
Xiao Wu, Alexander G. Hauptmann, Chong-Wah Ngo
MM
2010
ACM
200views Multimedia» more  MM 2010»
13 years 7 months ago
Multimodal location estimation
In this article we define a multimedia content analysis problem, which we call multimodal location estimation: Given a video/image/audio file, the task is to determine where it wa...
Gerald Friedland, Oriol Vinyals, Trevor Darrell