Signal Processing | Sciweavers

142

INTERSPEECH
2010

121views Signal Processing» more INTERSPEECH 2010»

HMM-based prosodic structure model using rich linguistic context

15 years 24 days ago

This paper presents a study on the use of deep syntactical features to improve prosody modeling 1 . A French linguistic processing chain based on linguistic preprocessing, morphos...

Nicolas Obin, Xavier Rodet, Anne Lacheret

claim paper

Read More »

159

click to vote

INTERSPEECH
2010

181views Signal Processing» more INTERSPEECH 2010»

Binary coding of speech spectrograms using a deep auto-encoder

15 years 24 days ago

Download research.microsoft.com

This paper reports our recent exploration of the layer-by-layer learning strategy for training a multi-layer generative model of patches of speech spectrograms. The top layer of t...

Li Deng, Michael L. Seltzer, Dong Yu, Alex Acero, ...

claim paper

Read More »

189

click to vote

INTERSPEECH
2010

189views Signal Processing» more INTERSPEECH 2010»

Efficient combined approach for named entity recognition in spoken language

15 years 24 days ago

Download azeddine.zidouni.perso.esil.univ-mrs.fr

We focus in this paper on the named entity recognition task in spoken data. The proposed approach investigates the use of various contexts of the words to improve recognition. Exp...

Azeddine Zidouni, Sophie Rosset, Hervé Glot...

claim paper

Read More »

158

click to vote

INTERSPEECH
2010

114views Signal Processing» more INTERSPEECH 2010»

Data pruning for template-based automatic speech recognition

15 years 24 days ago

Download www.esat.kuleuven.be

In this paper we describe and analyze a data pruning method in combination with template-based automatic speech recognition. We demonstrate the positive effects of polishing the t...

Dino Seppi, Dirk Van Compernolle

claim paper

Read More »

149

click to vote

INTERSPEECH
2010

124views Signal Processing» more INTERSPEECH 2010»

Efficient HMM-based estimation of missing features, with applications to packet loss concealment

15 years 24 days ago

Download www.ee.ucla.edu

In this paper, we present efficient HMM-based techniques for estimating missing features. By assuming speech features to be observations of hidden Markov processes, we derive a mi...

Bengt J. Borgström, Per Henrik Borgström...

claim paper

Read More »

134

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Say what? why users choose to speak their web queries

15 years 24 days ago

Download static.googleusercontent.com

The context in which a speech-driven application is used (or conversely not used) can be an important signal for recognition engines, and for spoken interface design. Using large-...

Maryam Kamvar, Doug Beeferman

claim paper

Read More »

166

click to vote

INTERSPEECH
2010

110views Signal Processing» more INTERSPEECH 2010»

Wiktionary as a source for automatic pronunciation extraction

15 years 24 days ago

Download csl.anthropomatik.kit.edu

In this paper, we analyze whether dictionaries from the World Wide Web which contain phonetic notations, may support the rapid creation of pronunciation dictionaries within the sp...

Tim Schlippe, Sebastian Ochs, Tanja Schultz

claim paper

Read More »

137

click to vote

INTERSPEECH
2010

101views Signal Processing» more INTERSPEECH 2010»

Strategies for statistical spoken language understanding with small amount of data - an empirical study

15 years 24 days ago

Download research.microsoft.com

The semantic frame based spoken language understanding involves two decisions

Ye-Yi Wang

claim paper

Read More »

188

click to vote

INTERSPEECH
2010

181views Signal Processing» more INTERSPEECH 2010»

Active appearance models for photorealistic visual speech synthesis

15 years 24 days ago

Download www.etro.vub.ac.be

The perceived quality of a synthetic visual speech signal greatly depends on the smoothness of the presented visual articulators. This paper explains how concatenative visual spee...

Wesley Mattheyses, Lukas Latacz, Werner Verhelst

claim paper

Read More »

159

click to vote

INTERSPEECH
2010

170views Signal Processing» more INTERSPEECH 2010»

Coping imbalanced prosodic unit boundary detection with linguistically-motivated prosodic features

15 years 24 days ago

Download www.ling.sinica.edu.tw

Continuous speech input for ASR processing is usually presegmented into speech stretches by pauses. In this paper, we propose that smaller, prosodically defined units can be ident...

Yi-Fen Liu, Shu-Chuan Tseng, Jyh-Shing Roger Jang,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers