Sciweavers

146 search results - page 12 / 30
» Automatic speech recognition performance on a voicemail tran...
Sort
View
ERCIMDL
2009
Springer
164views Education» more  ERCIMDL 2009»
14 years 2 months ago
A Web-Based Demo to Interactive Multimodal Transcription of Historic Text Images
Paleography experts spend many hours transcribing historic documents, and state-of-the-art handwritten text recognition systems are not suitable for performing this task automatica...
Verónica Romero, Luis A. Leiva, Vicente Ala...
ICPR
2008
IEEE
14 years 8 months ago
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition
This work extends and improves a recently introduced (Dec. 2007) dynamic Bayesian network (DBN) based audio-visual automatic speech recognition (AVASR) system. That system models ...
Louis H. Terry, Aggelos K. Katsaggelos
ICASSP
2011
IEEE
12 years 11 months ago
MLP based phoneme detectors for Automatic Speech Recognition
Phoneme posterior probabilities estimated using Multi-Layer Perceptrons (MLPs) are extensively used both as acoustic scores and features for speech recognition. In this paper we e...
Samuel Thomas, Patrick Nguyen, Geoffrey Zweig, Hyn...
ICASSP
2009
IEEE
14 years 2 months ago
Restoring punctuation and capitalization in transcribed speech
Adding punctuation and capitalization greatly improves the readability of automatic speech transcripts. We discuss an approach for performing both tasks in a single pass using a p...
Agustín Gravano, Martin Jansche, Michiel Ba...
LREC
2008
111views Education» more  LREC 2008»
13 years 9 months ago
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
Air traffic control (ATC) is based on voice communication between pilots and controllers and uses a highly task and domain specific language. Due to this very reason, spoken langu...
Konrad Hofbauer, Stefan Petrik, Horst Hering