Sciweavers

ICASSP
2010
IEEE
13 years 7 months ago
Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis
In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produc...
Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi,...
ICASSP
2010
IEEE
13 years 7 months ago
A comparison of approximate Viterbi techniques and particle filtering for data estimation in digital communications
We consider trellis-based algorithms for data estimation in digital communication systems. We present a general framework which includes approximate Viterbi algorithms like the M-...
Steffen Barembruch
ICASSP
2010
IEEE
13 years 7 months ago
Forensic estimation and reconstruction of a contrast enhancement mapping
Due to the ease with which convincing digital image forgeries can be created, a need has arisen for digital forensic techniques capable of detecting image manipulation. Once image...
Matthew C. Stamm, K. J. Ray Liu
ICASSP
2010
IEEE
13 years 7 months ago
Improving speech recognition by explicit modeling of phone deletions
In a paper published by Greenberg in 1998, it was said that in conversational speech, phone deletion rate may go as high as 12% whereas syllable deletion rate is about 1%. The fi...
Tom Ko, Brian Mak
ICASSP
2010
IEEE
13 years 7 months ago
Balancing false alarms and hits in Spoken Term Detection
This paper presents methods to improve retrieval of Out-OfVocabulary (OOV) terms in a Spoken Term Detection (STD) system. We demonstrate that automated tagging of OOV regions help...
Carolina Parada, Abhinav Sethy, Bhuvana Ramabhadra...
ICASSP
2010
IEEE
13 years 7 months ago
Adaptive search for sparse targets with informative priors
ACT This works considers the problem of efficient energy allocation of resources in a continuous fashion in determining the location of targets in a sparse environment. We extend ...
Gregory Newstadt, Eran Bashan, Alfred O. Hero III
ICASSP
2010
IEEE
13 years 7 months ago
Gradient descent approach for secure localization in resource constrained wireless sensor networks
Many sensor network related applications require precise knowledge of the location of constituent nodes. In these applications, it is desirable for the wireless nodes to be able t...
Ravi Garg, Avinash L. Varna, Min Wu
ICASSP
2010
IEEE
13 years 7 months ago
Noise robust exemplar-based connected digit recognition
This paper proposes a noise robust exemplar-based speech recognition system where noisy speech is modeled as a linear combination of a set of speech and noise exemplars. The metho...
Jort F. Gemmeke, Tuomas Virtanen
ICASSP
2010
IEEE
13 years 7 months ago
Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure
Class posterior distributions have recently been used quite successfully in Automatic Speech Recognition (ASR), either for frame or phone level classification or as acoustic featu...
Afsaneh Asaei, Benjamin Picart, Hervé Bourl...
ICASSP
2010
IEEE
13 years 7 months ago
Learning with synthesized speech for automatic emotion recognition
Data sparseness is an ever dominating problem in automatic emotion recognition. Using artificially generated speech for training or adapting models could potentially ease this: t...
Bjoern Schuller, Felix Burkhardt