Sciweavers

2509 search results - page 420 / 502
» Transactional stream processing
Sort
View
134
Voted
ICASSP
2008
IEEE
15 years 8 months ago
Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynami
We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...
Athanassios Katsamanis, George Papandreou, Petros ...
ICASSP
2008
IEEE
15 years 8 months ago
Quality evaluation of the G.EV-VBR speech codec
ITU-T has selected the candidate submitted by Ericsson, Nokia, Motorola, VoiceAge, and Texas Instruments as the baseline for the G.EV-VBR coding standard. G.EV-VBR is an embedded ...
Anssi Rämö, Henri Toukomaa, S. Craig Gre...
110
Voted
ICASSP
2008
IEEE
15 years 8 months ago
Multi-stream parameterization for structural speech recognition
Recently, a novel and structural representation of speech was proposed [1, 2], where the inevitable acoustic variations caused by nonlinguistic factors are effectively removed fro...
Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hiros...
115
Voted
ICASSP
2008
IEEE
15 years 8 months ago
Polyphase speech recognition
We propose a model for speech recognition that consists of multiple semi-synchronized recognizers operating on a polyphase decomposition of standard speech features. Specifically...
Hui Lin, Jeff Bilmes
102
Voted
ICIP
2008
IEEE
15 years 8 months ago
A multimodal approach to music transcription
Music transcription refers to extraction of a human readable and interpretable description from a recording of a music performance. Automatic music transcription remains, nowadays...
Marco Paleari, Benoit Huet, Antony Schutz, Dirk T....