Sciweavers

6553 search results - page 1203 / 1311
» Face modeling for recognition
Sort
View
124
Voted
ISVC
2009
Springer
15 years 11 months ago
Scene Categorization by Introducing Contextual Information to the Visual Words
In this paper, we propose a novel scene categorization method based on contextual visual words. In this method, we extend the traditional ‘bags of visual words’ model by introd...
Jianzhao Qin, Nelson H. C. Yung
KI
2009
Springer
15 years 11 months ago
Robust Processing of Situated Spoken Dialogue
Spoken dialogue is notoriously hard to process with standard language processing technologies. Dialogue systems must indeed meet two major challenges. First, natural spoken dialogu...
Pierre Lison, Geert-Jan M. Kruijff
MM
2009
ACM
125views Multimedia» more  MM 2009»
15 years 11 months ago
Unfolding speaker clustering potential: a biomimetic approach
Speaker clustering is the task of grouping a set of speech utterances into speaker-specific classes. The basic techniques for solving this task are similar to those used for spea...
Thilo Stadelmann, Bernd Freisleben
ICASSP
2008
IEEE
15 years 11 months ago
Phonetic pronunciations for arabic speech-to-text systems
In this paper two aspects of generating and using phonetic Arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of Arabic large...
Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Phi...
ICASSP
2008
IEEE
15 years 11 months ago
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
Phoneme segmentation is a fundamental problem in many speech recognition and synthesis studies. Unsupervised phoneme segmentation assumes no knowledge on linguistic contents and a...
Yu Qiao, Naoya Shimomura, Nobuaki Minematsu
« Prev « First page 1203 / 1311 Last » Next »