Sciweavers

300 search results - page 29 / 60
» The COST-277 Speech Database
Sort
View
ICASSP
2010
IEEE
13 years 8 months ago
Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure
Class posterior distributions have recently been used quite successfully in Automatic Speech Recognition (ASR), either for frame or phone level classification or as acoustic featu...
Afsaneh Asaei, Benjamin Picart, Hervé Bourl...
COLING
2002
13 years 8 months ago
Robust Interpretation of User Requests for Text Retrieval in a Multimodal Environment
We describe a parser for robust and flexible interpretation of user utterances in a multi-modal system for web search in newspaper databases. Users can speak or type, and they can...
Alexandra Klein, Estela Puig-Waldmüller, Hara...
INTERSPEECH
2010
13 years 3 months ago
Multi-pitch estimation by a joint 2-d representation of pitch and pitch dynamics
Multi-pitch estimation of co-channel speech is especially challenging when the underlying pitch tracks are close in pitch value (e.g., when pitch tracks cross). Building on our pr...
Tianyu T. Wang, Thomas F. Quatieri
ICASSP
2011
IEEE
13 years 12 days ago
Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. One such mismatch occurs when a SID system ...
Seyed Omid Sadjadi, John H. L. Hansen
ICASSP
2011
IEEE
13 years 12 days ago
Semantic data selection for vertical business voice search
Local business voice search is a popular application for mobile phones, where hands-free interaction and speed are critical to users. However, speech recognition accuracy is still...
Giuseppe Di Fabbrizio, Diamantino Caseiro, Amanda ...