Sciweavers

ICASSP
2009
IEEE
14 years 7 months ago
Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation ...
Xin Lei, Wen Wang, Stolcke Stolcke
ICASSP
2009
IEEE
14 years 7 months ago
A study on multilingual acoustic modeling for large vocabulary ASR
We study key issues related to multilingual acoustic modeling for automatic speech recognition (ASR) through a series of large-scale ASR experiments. Our study explores shared str...
Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero,...
ICASSP
2009
IEEE
14 years 7 months ago
A flat direct model for speech recognition
We introduce a direct model for speech recognition that assumes an unstructured, i.e., flat text output. The flat model allows us to model arbitrary attributes and dependences o...
Georg Heigold, Geoffrey Zweig, Xiao Li, Patrick Ng...
CISIS
2009
IEEE
14 years 7 months ago
Pervasive Informatics and Persistent Actimetric Information in Health Smart Homes: From Language Model to Location Model
—This paper presents an approach of location model deriving from language models existing in speech recognition research. The purpose is to applicate existing model in speech rec...
Yannick Fouquet, Jacques Demongeot, Nicolas Vuille...
CHI
2002
ACM
15 years 24 days ago
Dolltalk: a computational toy to enhance children's creativity
This paper presents a novel approach and interface for encouraging children to tell and act out original stories. "Dolltalk" is a toy that simulates speech recognition b...
Catherine Vaucelle, Tristan Jehan
CHI
2006
ACM
15 years 25 days ago
Speech pen: predictive handwriting based on ambient multimodal recognition
It is tedious to handwrite long passages of text by hand. To make this process more efficient, we propose predictive handwriting that provides input predictions when the user writ...
Kazutaka Kurihara, Masataka Goto, Jun Ogata, Takeo...
CHI
2006
ACM
15 years 25 days ago
Error correction of voicemail transcripts in SCANMail
Despite its widespread use, voicemail presents numerous usability challenges: People must listen to messages in their entirety, they cannot search by keywords, and audio files do ...
Moira Burke, Brian Amento, Philip L. Isenhour
ICPR
2006
IEEE
15 years 1 months ago
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models
In this paper we propose a novel order-recursive training algorithm for kernel-based discriminants which is computationally efficient. We integrate this method in a hybrid HMM-bas...
Edin Andelic, Marcel Katz, Martin Schafföner,...