Sciweavers

ICASSP
2008
IEEE
14 years 6 months ago
Modeling the intonation of discourse segments for improved online dialog ACT tagging
Prosody is an important cue for identifying dialog acts. In this paper, we show that modeling the sequence of acousticprosodic values as n-gram features with a maximum entropy mod...
Vivek Kumar Rangarajan Sridhar, Shrikanth Narayana...
ICASSP
2008
IEEE
14 years 6 months ago
Is voice transformation a threat to speaker identification?
With the development of voice transformation and speech synthesis technologies, speaker identification systems are likely to face attacks from imposters who use voice transformed ...
Qin Jin, Arthur R. Toth, Alan W. Black, Tanja Schu...
ICASSP
2008
IEEE
14 years 6 months ago
Maximum conditional likelihood linear regression and maximum a posteriori for hidden conditional random fields speaker adaptatio
This paper shows how to improve Hidden Conditional Random Fields (HCRFs) for phone classification by applying various speaker adaptation techniques. These include Maximum A Poste...
Yun-Hsuan Sung, Constantinos Boulis, Daniel Jurafs...
ICASSP
2008
IEEE
14 years 6 months ago
A turbo-style algorithm for lexical baseforms estimation
In this research, an iterative and unsupervised Turbo-style algorithm is presented and implemented for the task of automatic lexical acquisition. The algorithm makes use of spoken...
Ghinwa F. Choueiter, Mesrob I. Ohannessian, Stepha...
ICASSP
2008
IEEE
14 years 6 months ago
Analysis-by-synthesis features for speech recognition
We present a framework for speech recognition that accounts for hidden articulatory information. We model the articulatory space using a codebook of articulatory configurations g...
Ziad Al Bawab, Bhiksha Raj, Richard M. Stern
ICASSP
2008
IEEE
14 years 6 months ago
Unsupervised language model adaptation via topic modeling based on named entity hypotheses
Language model (LM) adaptation is often achieved by combining a generic LM with a topic-specific model that is more relevant to the target document. Unlike previous work on unsup...
Yang Liu, Feifan Liu
ICASSP
2008
IEEE
14 years 6 months ago
Interactive environmental sensing: Signal and image processing challenges
Networked embedded acoustic sensors and imagers allow scientists to observe biological and environmental phenomena at high sampling rates and multiple scales. Such sampling can cr...
Michael Allen, Eric Graham, Shaun Ahmadian, Tetsun...
ICASSP
2008
IEEE
14 years 6 months ago
A top-down auditory attention model for learning task dependent influences on prominence detection in speech
A top-down task-dependent model guides attention to likely target locations in cluttered scenes. Here, a novel biologically plausible top-down auditory attention model is presente...
Ozlem Kalinli, Shrikanth S. Narayanan
ICASSP
2008
IEEE
14 years 6 months ago
Corrected tandem features for acoustic model training
This paper describes a simple method for significantly improving Tandem features used to train acoustic models for large-vocabulary speech recognition. The linear activations at ...
Arlo Faria, Nelson Morgan
ICASSP
2008
IEEE
14 years 6 months ago
Semantic composition process in a speech understanding system
A knowledge representation formalism for SLU is introduced. It is used for incremental and partially automated annotation of the MEDIA corpus in terms of semantic structures. An a...
Frédéric Duvert, Marie-Jean Meurs, C...