Sciweavers

584 search results - page 87 / 117
» Physical Audio Signal Processing
Sort
View
ICASSP
2009
IEEE
13 years 7 months ago
Audiovisual celebrity recognition in unconstrained web videos
The number of video clips available online is growing at a tremendous pace. Conventionally, user-supplied metadata text, such as the title of the video and a set of keywords, has ...
Mehmet Emre Sargin, Hrishikesh Aradhye, Pedro J. M...
ICASSP
2011
IEEE
13 years 1 months ago
Degenerate Unmixing Estimation Technique using the Constant Q Transform
The Degenerate Unmixing Estimation Technique (DUET) is a Blind Source Separation (BSS) algorithm for stereo audio. DUET depends on an amplitude-phase 2d histogram built from the d...
Zafar Rafii, Bryan Pardo
ICASSP
2011
IEEE
13 years 1 months ago
Improved models for Mandarin speech-to-text transcription
This paper describes recent advances at LIMSI in Mandarin Chinese speech-to-text transcription. A number of novel approaches were introduced in the different system components. Th...
Lori Lamel, Jean-Luc Gauvain, Viet-Bac Le, Ilya Op...
ICASSP
2011
IEEE
13 years 1 months ago
Using multiple visual tandem streams in audio-visual speech recognition
The method which is called the “tandem approach” in speech recognition has been shown to increase performance by using classifier posterior probabilities as observations in a...
Ibrahim Saygin Topkaya, Hakan Erdogan
ICASSP
2011
IEEE
13 years 1 months ago
Emotion classification from speech using evaluator reliability-weighted combination of ranked lists
In emotion recognition, a widely-used method to reconciliate disagreement between multiple human evaluators is to perform majority-voting on their assigned class labels. Instead, ...
Kartik Audhkhasi, Shrikanth S. Narayanan