Sciweavers

193 search results - page 9 / 39
» Speech Structure and Its Application to Robust Speech Proces...
Sort
View
ICASSP
2008
IEEE
14 years 1 months ago
Quality evaluation of the G.EV-VBR speech codec
ITU-T has selected the candidate submitted by Ericsson, Nokia, Motorola, VoiceAge, and Texas Instruments as the baseline for the G.EV-VBR coding standard. G.EV-VBR is an embedded ...
Anssi Rämö, Henri Toukomaa, S. Craig Gre...
ICASSP
2011
IEEE
12 years 11 months ago
A novel approach using modulation features for multiphone-based speech recognition
Recent advances in coherent and convex demodulation have proven useful for analyzing and modifying the low-frequency envelope structure of speech. This paper reports the applicati...
Pascal Clark, Gregory Sell, Les E. Atlas
ICASSP
2009
IEEE
14 years 2 months ago
Affine invariant features and their application to speech recognition
This paper proposes a set of affine invariant features (AIFs) for sequence data. The proposed AIFs can be calculated directly from the sequence data, and their invariance to af...
Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu
ACL
2007
13 years 8 months ago
Making Sense of Sound: Unsupervised Topic Segmentation over Acoustic Input
We address the task of unsupervised topic segmentation of speech data operating over raw acoustic information. In contrast to existing algorithms for topic segmentation of speech,...
Igor Malioutov, Alex Park, Regina Barzilay, James ...
TASLP
2008
143views more  TASLP 2008»
13 years 7 months ago
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarizatio
Many current state-of-the-art speaker diarization systems exploit agglomerative hierarchical clustering (AHC) as their speaker clustering strategy, due to its simple processing str...
K. J. Han, S. Kim, S. S. Narayanan