Sciweavers

202 search results - page 40 / 41
» Comparing Humans and Automatic Speech Recognition Systems in...
Sort
View
CSL
2002
Springer
13 years 7 months ago
Learning visually grounded words and syntax for a scene description task
A spoken language generation system has been developed that learns to describe objects in computer-generated visual scenes. The system is trained by a `show-and-tell' procedu...
Deb K. Roy
ICASSP
2011
IEEE
12 years 11 months ago
Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil
This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech...
Ngoc Thang Vu, Franziska Kraus, Tanja Schultz
LREC
2010
130views Education» more  LREC 2010»
13 years 9 months ago
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification
The LTSE-VAD is one of the best known algorithms for voice activity detection. In this paper we present a modified version of this algorithm, that makes the VAD decision not takin...
Iker Luengo, Eva Navas, Igor Odriozola, Ibon Sarat...
EJASMP
2010
136views more  EJASMP 2010»
13 years 2 months ago
Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance
With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined t...
Ravichander Vipperla, Steve Renals, Joe Frankel
ICASSP
2011
IEEE
12 years 11 months ago
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model
Korean is an agglutinative language that does not have explicit word boundaries. It is also a highly inflective language that exhibits severe coarticulation effects. These charac...
Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, ...