Sciweavers

33 search results - page 4 / 7
» Speech Activity Detection on Multichannels of Meeting Record...
Sort
View
DSMML
2004
Springer
14 years 1 months ago
Multi Channel Sequence Processing
Abstract. This paper summarizes some of the current research challenges arising from multi-channel sequence processing. Indeed, multiple real life applications involve simultaneous...
Samy Bengio, Hervé Bourlard
ICASSP
2011
IEEE
13 years 8 days ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland
CLEAR
2007
Springer
121views Biometrics» more  CLEAR 2007»
14 years 2 months ago
Progress in the AMIDA Speaker Diarization System for Meeting Data
In this paper we describe the AMIDA speaker dizarization system as it was submitted to the NIST Rich Transcription evaluation 2007 for conference room data. This is done in the con...
David A. van Leeuwen, Matej Konecný
LRE
2007
101views more  LRE 2007»
13 years 8 months ago
The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms
The analysis of lectures and meetings inside smart rooms has recently attracted much interest in the literature, being the focus of international projects and technology evaluation...
Djamel Mostefa, Nicolas Moreau, Khalid Choukri, Ge...
MLMI
2007
Springer
14 years 2 months ago
Using Prosodic Features in Language Models for Meetings
Abstract. Prosody has been actively studied as an important knowledge source for speech recognition and understanding. In this paper, we are concerned with the question of exploiti...
Songfang Huang, Steve Renals