Signal Processing | Sciweavers

130

INTERSPEECH
2010

115views Signal Processing» more INTERSPEECH 2010»

The use of subvector quantization and discrete densities for fast GMM computation for speaker verification

15 years 24 days ago

Last year, we showed that the computation of a GMM-UBMbased speaker verification (SV) system may be sped up by 30 times by using a high-density discrete model (HDDM) on the NIST 2...

Guoli Ye, Brian Mak

claim paper

Read More »

145

click to vote

INTERSPEECH
2010

122views Signal Processing» more INTERSPEECH 2010»

Continuous speech recognition with a TF-IDF acoustic model

15 years 24 days ago

Download research.microsoft.com

Information retrieval methods are frequently used for indexing and retrieving spoken documents, and more recently have been proposed for voice-search amongst a pre-defined set of ...

Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, Alex...

claim paper

Read More »

99

click to vote

INTERSPEECH
2010

93views Signal Processing» more INTERSPEECH 2010»

Detecting categorical perception in continuous discrimination data

15 years 24 days ago

Download www.fon.hum.uva.nl

We present a method for assessing categorical perception from continuous discrimination data. Until recently, categorical perception of speech has exclusively been measured by dis...

Paul Boersma, Katerina Chládková

claim paper

Read More »

142

click to vote

INTERSPEECH
2010

114views Signal Processing» more INTERSPEECH 2010»

Phonetic realization of second occurrence focus in Japanese

15 years 24 days ago

Download www.ling.upenn.edu

Previous studies have recently agreed that second occurrence focus is phonetically realized as prosodic prominence. What has been missing in the previous studies, however, is a co...

Satoshi Nambu, Yong-cheol Lee

claim paper

Read More »

143

click to vote

INTERSPEECH
2010

116views Signal Processing» more INTERSPEECH 2010»

Speaking style dependency of formant targets

15 years 24 days ago

Download www.cslu.ogi.edu

Previous work on formant targets has assumed that these targets are independent of the speaking style. In this paper, we estimate consonant and vowel targets in a database of &quo...

Akiko Amano-Kusumoto, John-Paul Hosom, Alexander K...

claim paper

Read More »

135

click to vote

INTERSPEECH
2010

124views Signal Processing» more INTERSPEECH 2010»

Phonetic subspace mixture model for speaker diarization

15 years 24 days ago

Download www.iis.sinica.edu.tw

This paper presents an improved distance measure for speaker clustering in speaker diarization systems. The proposed phonetic subspace mixture (PSM) model introduces phonetic info...

I-Fan Chen, Shih-Sian Cheng, Hsin-Min Wang

claim paper

Read More »

139

click to vote

INTERSPEECH
2010

102views Signal Processing» more INTERSPEECH 2010»

An HMM trajectory tiling (HTT) approach to high quality TTS

15 years 24 days ago

Download festvox.org

We propose an HMM Trajectory Tiling (HTT) approach to high quality TTS, which is our entry to Blizzard Challenge 2010. In HTT, first refined HMM is trained with the Minimum Genera...

Yao Qian, Zhi-Jie Yan, Yijian Wu, Frank K. Soong, ...

claim paper

Read More »

138

click to vote

INTERSPEECH
2010

106views Signal Processing» more INTERSPEECH 2010»

Laryngealization and features for Chinese tonal recognition

15 years 24 days ago

Download www.linguistics.ucla.edu

It is well known that the lowest tone in Mandarin, a language without contrastive phonation, often co-occurs with laryngealization/creaky voice quality, and we provide evidence th...

Kristine M. Yu

claim paper

Read More »

118

click to vote

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

A comparative large scale study of MLP features for Mandarin ASR

15 years 24 days ago

Download www.speech.sri.com

MLP based front-ends have shown significant complementary properties to conventional spectral features. As part of the DARPA GALE program, different MLP features were developed fo...

Fabio Valente, Mathew Magimai-Doss, Christian Plah...

claim paper

Read More »

131

click to vote

INTERSPEECH
2010

93views Signal Processing» more INTERSPEECH 2010»

What do you mean, you're uncertain?: the interpretation of cue words and rising intonation in dialogue

15 years 24 days ago

Download www.ling.upenn.edu

This paper investigates how rising intonation affects the interpretation of cue words in dialogue. Both cue words and rising intonation express a range of speaker attitudes like u...

Catherine Lai

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers