Sciweavers

103 search results - page 19 / 21
» Contextual Information Improves OOV Detection in Speech
Sort
View
KES
2008
Springer
13 years 7 months ago
Towards Natural Head Movement of Autonomous Speaker Agent
Autonomous Speaker Agent (ASA) is a graphically embodied animated agent capable of reading plain English text and rendering it in a form of speech, accompanied by appropriate, natu...
Marko Brkic, Karlo Smid, Tomislav Pejsa, Igor S. P...
ICASSP
2010
IEEE
13 years 6 months ago
A comparison of approaches for modeling prosodic features in speaker recognition
Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracte...
Luciana Ferrer, Nicolas Scheffer, Elizabeth Shribe...
ICASSP
2011
IEEE
12 years 11 months ago
A modified MAP criterion based on hidden Markov model for voice activity detecion
The maximum a posteriori (MAP) criterion is broadly used in the statistical model-based voice activity detection (VAD) approaches. In the conventional MAP criterion, however, the ...
Shiwen Deng, Jiqing Han, Tieran Zheng, Guibin Zhen...
CIVR
2007
Springer
247views Image Analysis» more  CIVR 2007»
14 years 1 months ago
Near-duplicate keyframe retrieval with visual keywords and semantic context
Near-duplicate keyframes (NDK) play a unique role in large-scale video search, news topic detection and tracking. In this paper, we propose a novel NDK retrieval approach by explo...
Xiao Wu, Wanlei Zhao, Chong-Wah Ngo
CVIU
2008
124views more  CVIU 2008»
13 years 7 months ago
Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news
News videos from different channels, languages are broadcast everyday, which provide abundant information for users. To effectively search, retrieve, browse and track news stories...
Xiao Wu, Alexander G. Hauptmann, Chong-Wah Ngo