Sciweavers

910 search results - page 127 / 182
» Standardization of Speech Corpus
Sort
View
ICASSP
2009
IEEE
14 years 3 months ago
Formant-based technique for automatic filled-pause detection in spontaneous spoken english
Detection of filled pauses is a challenging research problem which has several practical applications. It can be used to evaluate the spoken fluency skills of the speaker, to im...
Kartik Audhkhasi, Kundan Kandhway, Om Deshmukh, As...
ICASSP
2009
IEEE
14 years 3 months ago
Scalable superwideband extension for wideband coding
Recent trends in speech and audio codec standardization include scalability and extending the signal bandwidth beyond wideband (WB) to superwideband (SWB). In this paper we introd...
Mikko Tammi, Lasse Laaksonen, Anssi Rämö...
NAACL
2010
13 years 6 months ago
Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions
We describe a method of incorporating taskspecific cost functions into standard conditional log-likelihood (CLL) training of linear structured prediction models. Recently introduc...
Kevin Gimpel, Noah A. Smith
MM
2009
ACM
169views Multimedia» more  MM 2009»
14 years 3 months ago
Visual speaker localization aided by acoustic models
The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single f...
Gerald Friedland, Chuohao Yeo, Hayley Hung
ICASSP
2008
IEEE
14 years 3 months ago
Fine-grained pitch accent and boundary tone labeling with parametric F0 features
Motivated by linguistic theories of prosodic categoricity, symbolic representations of prosody have recently attracted the attention of speech technologists. Categorical represent...
Sankaranarayanan Ananthakrishnan, Shrikanth Naraya...