Sciweavers

641 search results - page 120 / 129
» Spectral Audio Signal Processing
Sort
View
ICASSP
2010
IEEE
13 years 6 months ago
Speech/Non-Speech Detection in Meetings from Automatically Extracted low Resolution Visual Features
In this paper we address the problem of estimating who is speaking from automatically extracted low resolution visual cues in group meetings. Traditionally, the task of speech/non...
Hayley Hung, Sileye O. Ba
ICASSP
2009
IEEE
13 years 5 months ago
Audiovisual celebrity recognition in unconstrained web videos
The number of video clips available online is growing at a tremendous pace. Conventionally, user-supplied metadata text, such as the title of the video and a set of keywords, has ...
Mehmet Emre Sargin, Hrishikesh Aradhye, Pedro J. M...
ICASSP
2011
IEEE
12 years 11 months ago
Degenerate Unmixing Estimation Technique using the Constant Q Transform
The Degenerate Unmixing Estimation Technique (DUET) is a Blind Source Separation (BSS) algorithm for stereo audio. DUET depends on an amplitude-phase 2d histogram built from the d...
Zafar Rafii, Bryan Pardo
ICASSP
2011
IEEE
12 years 11 months ago
Improved models for Mandarin speech-to-text transcription
This paper describes recent advances at LIMSI in Mandarin Chinese speech-to-text transcription. A number of novel approaches were introduced in the different system components. Th...
Lori Lamel, Jean-Luc Gauvain, Viet-Bac Le, Ilya Op...
ICASSP
2011
IEEE
12 years 11 months ago
Using multiple visual tandem streams in audio-visual speech recognition
The method which is called the “tandem approach” in speech recognition has been shown to increase performance by using classifier posterior probabilities as observations in a...
Ibrahim Saygin Topkaya, Hakan Erdogan