Search Sciweavers | Sciweavers

123 search results - page 15 / 25

» Improving Acoustic Models with Captioned Multimedia Speech

click to vote

ICPR
2006
IEEE

167views computer vision» more ICPR 2006»

Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks

14 years 8 months ago

Download ssli.ee.washington.edu

This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model...

Hui Lin 0001, Zhijian Ou

claim paper

Read More »

click to vote

COLING
2008

143views Computational Linguistics» more COLING 2008»

Towards Incremental End-of-Utterance Detection in Dialogue Systems

13 years 9 months ago

Download www.aclweb.org

We define the task of incremental or 0lag utterance segmentation, that is, the task of segmenting an ongoing speech recognition stream into utterance units, and present first resu...

Michaela Atterer, Timo Baumann, David Schlangen

claim paper

Read More »

click to vote

ICASSP
2009
IEEE

141views Signal Processing» more ICASSP 2009»

A speech fragment approach to localising multiple speakers in reverberant environments

14 years 2 months ago

Download perception.inrialpes.fr

Sound source localisation cues are severely degraded when multiple acoustic sources are active in the presence of reverberation. We present a binaural system for localising simult...

Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon...

claim paper

Read More »

click to vote

ICMCS
2006
IEEE

117views Multimedia» more ICMCS 2006»

Data Hiding for Speech Bandwidth Extension and its Hardware Implementation

14 years 1 months ago

Download www.cecs.uci.edu

Most of the current speech transmission systems are only able to deliver speech signals in a narrow frequency band. This narrowband speech is characterized by a thin and mufﬂed ...

Fan Wu, Siyue Chen, Henry Leung

claim paper

Read More »

click to vote

PAMI
2002

98views more PAMI 2002»

Extraction of Visual Features for Lipreading

13 years 7 months ago

Download www.ri.cmu.edu

The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information...

Iain Matthews, Timothy F. Cootes, J. Andrew Bangha...

claim paper

Read More »

« Prev « First page 15 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers