Signal Processing | Sciweavers

111

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

Measuring basic tempo across languages and some implications for speech rhythm

14 years 9 months ago

Basic language-inherent tempo cannot be isolated by the current metrics of speech rhythm. Here we propose the number of syllables per intonation unit as an appropriate measure, al...

Gertraud Fenk-Oczlon, August Fenk

claim paper

Read More »

107

click to vote

INTERSPEECH
2010

116views Signal Processing» more INTERSPEECH 2010»

Glottal-based analysis of the lombard effect

14 years 9 months ago

Download tcts.fpms.ac.be

The Lombard effect refers to the speech changes due to the immersion of the speaker in a noisy environment. Among these changes, studies have already reported acoustic modificatio...

Thomas Drugman, Thierry Dutoit

claim paper

Read More »

98

click to vote

INTERSPEECH
2010

140views Signal Processing» more INTERSPEECH 2010»

An improved wavelet-based dereverberation for robust automatic speech recognition

14 years 9 months ago

Download www.ar.media.kyoto-u.ac.jp

This paper presents an improved wavelet-based dereverberation method for automatic speech recognition (ASR). Dereverberation is based on filtering reverberant wavelet coefficients...

Randy Gomez, Tatsuya Kawahara

claim paper

Read More »

120

Voted

INTERSPEECH
2010

138views Signal Processing» more INTERSPEECH 2010»

Discriminative adaptation for log-linear acoustic models

14 years 9 months ago

Download www-i6.informatik.rwth-aachen.de

Log-linear models have recently been used in acoustic modeling for speech recognition systems. This has been motivated by competitive results compared to systems based on Gaussian...

Jonas Lööf, Ralf Schlüter, Hermann ...

claim paper

Read More »

115

click to vote

INTERSPEECH
2010

90views Signal Processing» more INTERSPEECH 2010»

14 years 9 months ago

Pitch similarity in the vicinity of backchannels

Download www.cs.columbia.edu

Dynamic modeling of spoken dialogue seeks to capture how interlocutors change their speech over the course of a conversation. Much work has focused on how speakers adapt or entrai...

Mattias Heldner, Jens Edlund, Julia Hirschberg

claim paper

Read More »

130

click to vote

INTERSPEECH
2010

121views Signal Processing» more INTERSPEECH 2010»

Learning from human errors: prediction of phoneme confusions based on modified ASR training

14 years 9 months ago

Download medi.uni-oldenburg.de

In an attempt to improve models of human perception, the recognition of phonemes in nonsense utterances was predicted with automatic speech recognition (ASR) in order to analyze i...

Bernd T. Meyer, Birger Kollmeier

claim paper

Read More »

93

click to vote

INTERSPEECH
2010

118views Signal Processing» more INTERSPEECH 2010»

Expectations for discourse genre identification: a prosodic study

14 years 9 months ago

Download articles.ircam.fr

Speech can be divided into discourse genres based on the contextual environment it occurs in (e.g. political speech, sport commentary speech, etc.). The present study investigated...

Nicolas Obin, Volker Dellwo, Anne Lacheret, Xavier...

claim paper

Read More »

91

click to vote

INTERSPEECH
2010

100views Signal Processing» more INTERSPEECH 2010»

Improved neural network based language modelling and adaptation

14 years 9 months ago

Download mi.eng.cam.ac.uk

Neural network language models (NNLM) have become an increasingly popular choice for large vocabulary continuous speech recognition (LVCSR) tasks, due to their inherent generalisa...

Junho Park, Xunying Liu, Mark J. F. Gales, Philip ...

claim paper

Read More »

80

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Building transcribed speech corpora quickly and cheaply for many languages

14 years 9 months ago

Download static.googleusercontent.com

We present a system for quickly and cheaply building transcribed speech corpora containing utterances from many speakers in a variety of acoustic conditions. The system consists o...

Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu...

claim paper

Read More »

115

Voted

INTERSPEECH
2010

109views Signal Processing» more INTERSPEECH 2010»

Distribution and trichotomic realization of voiced velars in Japanese - an experimental study

14 years 9 months ago

Download www.geocities.jp

In this paper, we demonstrate the trichotomic realization of voiced velars in Japanese, challenging the traditional plosive/nasal dichotomy of velar allophones, and examine the di...

Shin-ichiro Sano, Tomohiko Ooigawa

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers