Search Sciweavers | Sciweavers

775 search results - page 56 / 155

» Processing Self Corrections in a speech to speech system

click to vote

ICASSP
2011
IEEE

119views Signal Processing» more ICASSP 2011»

A multi-stream ASR framework for BLSTM modeling of conversational speech

12 years 11 months ago

Download mirlab.org

We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...

Martin Wöllmer, Florian Eyben, Björn Sch...

claim paper

Read More »

click to vote

ICASSP
2010
IEEE

134views Signal Processing» more ICASSP 2010»

Information retrieval methods for automatic speech recognition

13 years 8 months ago

Download research.microsoft.com

In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential beneﬁts include improved speed, accuracy, and scalability...

Xiaoqiang Xiao, Jasha Droppo, Alex Acero

claim paper

Read More »

click to vote

IUI
2006
ACM

139views Software Engineering» more IUI 2006»

Three phase verification for spoken dialog clarification

14 years 1 months ago

Download isoft.postech.ac.kr

Spoken dialog tasks incur many errors including speech recognition errors, understanding errors, and even dialog management errors. These errors create a big gap between user'...

Sangkeun Jung, Cheongjae Lee, Gary Geunbae Lee

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

119views Signal Processing» more ICASSP 2008»

Multilingual weighted codebooks

14 years 2 months ago

Download titan.segv.de

In this paper we present an approach for speech recognition of multiple languages with constrained resources on embedded devices. Examples of such systems are navigation systems, ...

Martin Raab, Rainer Gruhn, Elmar Nöth

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

149views Signal Processing» more ICASSP 2011»

Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production

12 years 11 months ago

Download mirlab.org

An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...

Damien Kelly, Anil Kokaram, Frank Boland

claim paper

Read More »

« Prev « First page 56 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers