Sciweavers

683 search results - page 101 / 137
» speech 2008
Sort
View
ACL
2008
15 years 5 months ago
Lexicalized Phonotactic Word Segmentation
This paper presents a new unsupervised algorithm (WordEnds) for inferring word boundaries from transcribed adult conversations. Phone ngrams before and after observed pauses are u...
Margaret M. Fleck
IADIS
2003
15 years 5 months ago
Multimodal Interaction and Access to Complex Data
Today’s users want to access their data everywhere and any time – in various environments and occasions. The data itself can be very complex – the problem is then in providi...
Vladislav Nemec, Pavel Zikovsky, Pavel Slaví...
156
Voted
COST
2008
Springer
157views Multimedia» more  COST 2008»
15 years 5 months ago
Multimodal Human Machine Interactions in Virtual and Augmented Reality
Virtual worlds are developing rapidly over the internet. They are visited by avatars and staffed with Embodied Conversational Agents (ECAs). An avatar is a representation of a phys...
Gérard Chollet, Anna Esposito, Annie Gentes...
145
Voted
ICASSP
2011
IEEE
14 years 7 months ago
Source-normalised-and-weighted LDA for robust speaker recognition using i-vectors
The recently developed i-vector framework for speaker recognition has set a new performance standard in the research field. An i-vector is a compact representation of a speaker u...
Mitchell McLaren, David A. van Leeuwen
ICASSP
2008
IEEE
15 years 10 months ago
High-dynamic range compression using a fast multiscale optimization
to appear in Proc. IEEE Int’l Conf. on Acoustics, Speech, and Signal Processing, March, 2008 High-dynamic-range medical images take intensity values which cannot be visualized o...
Matthieu Maitre, Yunqiang Chen, Tong Fang