Sciweavers

123 search results - page 17 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
ICASSP
2011
IEEE
12 years 11 months ago
Enriching Mandarin speech recognition by incorporating a hierarchical prosody model
This paper presents a new probabilistic framework of Mandarin speech recognition by incorporating a sophisticated hierarchical prosody model into the conventional HMM-based system...
Jyh-Her Yang, Ming-Chieh Liu, Hao-Hsiang Chang, Ch...
FLAIRS
2004
13 years 9 months ago
Speaker Verification Using Speaker-Specific Prompts
Intra- and inter-speaker information, which include acoustical, speaker style, speech rate and temporal variation, despite their critical importance for the verification of claims...
Yongxin Zhang, Adel Iskander Fahmy, Michael S. Sco...
ICASSP
2008
IEEE
14 years 2 months ago
Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages
Automatic speech recognition (ASR) systems have been developed only for a very limited number of the estimated 7,000 languages in the world. In order to avoid the evolvement of a ...
Sebastian Stüker
ISNN
2011
Springer
12 years 10 months ago
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents
Abstract. Systems for keyword and non-linguistic vocalization detection in conversational agent applications need to be robust with respect to background noise and different speak...
Martin Wöllmer, Erik Marchi, Stefano Squartin...
NAACL
2007
13 years 9 months ago
Advances in the CMU/Interact Arabic GALE Transcription System
This paper describes the CMU/InterACT effort in developing an Arabic Automatic Speech Recognition (ASR) system for broadcast news and conversations within the GALE 2006 evaluation...
Mohamed Noamany, Thomas Schaaf, Tanja Schultz