Sciweavers

775 search results - page 56 / 155
» Processing Self Corrections in a speech to speech system
Sort
View
ICASSP
2011
IEEE
12 years 11 months ago
A multi-stream ASR framework for BLSTM modeling of conversational speech
We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...
Martin Wöllmer, Florian Eyben, Björn Sch...
ICASSP
2010
IEEE
13 years 8 months ago
Information retrieval methods for automatic speech recognition
In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential benefits include improved speed, accuracy, and scalability...
Xiaoqiang Xiao, Jasha Droppo, Alex Acero
IUI
2006
ACM
14 years 1 months ago
Three phase verification for spoken dialog clarification
Spoken dialog tasks incur many errors including speech recognition errors, understanding errors, and even dialog management errors. These errors create a big gap between user'...
Sangkeun Jung, Cheongjae Lee, Gary Geunbae Lee
ICASSP
2008
IEEE
14 years 2 months ago
Multilingual weighted codebooks
In this paper we present an approach for speech recognition of multiple languages with constrained resources on embedded devices. Examples of such systems are navigation systems, ...
Martin Raab, Rainer Gruhn, Elmar Nöth
ICASSP
2011
IEEE
12 years 11 months ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland