Sciweavers

ICASSP
2011
IEEE
13 years 3 months ago
A sampling-based environment population projection approach for rapid acoustic model adaptation
We propose an environment population projection (EPP) approach for rapid acoustic model adaptation to reduce environment mismatches with limited amounts of adaptation data. This a...
Yu Tsao, Shigeki Matsuda, Shinsuke Sakai, Ryosuke ...
TASLP
2010
133views more  TASLP 2010»
13 years 6 months ago
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
In the presence of environmental noise, speakers tend to adjust their speech production in an effort to preserve intelligible communication. The noise-induced speech adjustments, c...
Hynek Boril, John H. L. Hansen
TSD
2010
Springer
13 years 9 months ago
Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data
In this paper, we study the use of heterogeneous data for training of acoustic models. In initial experiments, a significant drop of accuracy has been observed on in-domain test s...
Martin Karafiát, Igor Szöke, Jan Cerno...
LRE
2010
136views more  LRE 2010»
13 years 10 months ago
The Corpus DIMEx100: transcription and evaluation
In this paper the transcription and evaluation of the corpus DIMEx100 for Mexican Spanish is presented. First we describe the corpus and explain the linguistic and computational mo...
Luis Alberto Pineda, Hayde Castellanos, Javier Cu&...
CSL
2002
Springer
13 years 11 months ago
Lightly supervised and unsupervised acoustic model training
The last decade has witnessed substantial progress in speech recognition technology, with todays state-of-the-art systems being able to transcribe unrestricted broadcast news audi...
Lori Lamel, Jean-Luc Gauvain, Gilles Adda
ICASSP
2010
IEEE
13 years 11 months ago
Multi-style MLP features for BN transcription
It has become common practice to adapt acoustic models to specific-conditions (gender, accent, bandwidth) in order to improve the performance of speech-to-text (STT) transcriptio...
Viet-Bac Le, Lori Lamel, Jean-Luc Gauvain
NAACL
2001
14 years 24 days ago
Generating Training Data for Medical Dictations
In automatic speech recognition (ASR) enabled applications for medical dictations, corpora of literal transcriptions of speech are critical for training both speaker independent a...
Sergey V. Pakhomov, Michael Schonwetter, Joan Bach...
LREC
2008
118views Education» more  LREC 2008»
14 years 26 days ago
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation
Multilingual Automatic Speech Recognition (ASR) systems are of great interest in multilingual environments. We studied the case of the Comunitat Valenciana where the two official ...
Míriam Luján-Mares, Carlos D. Mart&i...
CICLING
2005
Springer
14 years 1 months ago
Toward Acoustic Models for Languages with Limited Linguistic Resources
This paper discuses preliminary results on acoustic models creation through acoustic models already in existence for another language. In this work we show as case of study, the cr...
Luis Villaseñor Pineda, Viet Bac Le, Manuel...
ICPR
2008
IEEE
15 years 19 days ago
Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali
The performance of the acoustic models is highly reflective on the overall performance of any continuous speech recognition system. Hence generation of an accurate and robust acou...
Anupam Basu, Gaurav Garg, Pabitra Mitra, Pratyush ...