Sciweavers

334 search results - page 51 / 67
» Improving speech playback using time-compression and speech ...
Sort
View
ICASSP
2011
IEEE
12 years 10 months ago
Multi-class Model M
Model M, a novel class-based exponential language model, has been shown to significantly outperform word n-gram models in state-of-the-art machine translation and speech recognit...
Ahmad Emami, Stanley F. Chen
TASLP
2008
136views more  TASLP 2008»
13 years 6 months ago
On Acoustic Diversification Front-End for Spoken Language Identification
The parallel phone recognition followed by language model (PPRLM) architecture represents one of the state-of-the-art spoken language identification systems. A PPRLM system compris...
Khe Chai Sim, Haizhou Li
ICASSP
2010
IEEE
13 years 7 months ago
Acoustic front-end optimization for bird species recognition
The goal of this work was to explore the optimization of the feature extraction module (front-end) parameters to improve bird species recognition. We explored optimizing the spect...
Martin Graciarena, Michelle Delplanche, Elizabeth ...
ICML
2003
IEEE
14 years 7 months ago
Discriminative Gaussian Mixture Models: A Comparison with Kernel Classifiers
We show that a classifier based on Gaussian mixture models (GMM) can be trained discriminatively to improve accuracy. We describe a training procedure based on the extended Baum-W...
Aldebaro Klautau, Nikola Jevtic, Alon Orlitsky
CHI
1996
ACM
13 years 11 months ago
MailCall: Message Presentation and Navigation in a Nonvisual Environment
MailCall is a telephone-based messaging system using speech recognition and synthesis. Its nonvisual interaction approaches the usability of visual systems through a combination o...
Matthew Marx, Chris Schmandt