Sciweavers

1812 search results - page 183 / 363
» Signal Processing in Large Systems: a New Paradigm
Sort
View
ICASSP
2009
IEEE
13 years 6 months ago
Audiovisual celebrity recognition in unconstrained web videos
The number of video clips available online is growing at a tremendous pace. Conventionally, user-supplied metadata text, such as the title of the video and a set of keywords, has ...
Mehmet Emre Sargin, Hrishikesh Aradhye, Pedro J. M...
ICASSP
2011
IEEE
12 years 12 months ago
Improved models for Mandarin speech-to-text transcription
This paper describes recent advances at LIMSI in Mandarin Chinese speech-to-text transcription. A number of novel approaches were introduced in the different system components. Th...
Lori Lamel, Jean-Luc Gauvain, Viet-Bac Le, Ilya Op...
ICASSP
2011
IEEE
12 years 12 months ago
Machine and acoustical condition dependency analyses for fast acoustic likelihood calculation techniques
The acceleration of acoustic likelihood calculation has been an important research issue for developing practical speech recognition systems. And there are various specification ...
Atsunori Ogawa, Satoshi Takahashi, Atsushi Nakamur...
CA
2003
IEEE
14 years 1 months ago
Language-Driven Nonverbal Communication in a Bilingual Conversational Agent
This paper describes an animated conversational agent called Kare1 which integrates a talking head interface with a linguistically motivated human-machine dialogue system. The age...
Scott A. King, Alistair Knott, Brendan McCane
ISMIR
2005
Springer
179views Music» more  ISMIR 2005»
14 years 1 months ago
Databionic Visualization of Music Collections According to Perceptual Distance
We describe the MusicMiner system for organizing large collections of music with databionic mining techniques. Low level audio features are extracted from the raw audio data on sh...
Fabian Mörchen, Alfred Ultsch, Mario Nöc...