Sciweavers

70 search results - page 10 / 14
» An Online System for Synchronized Processing of Video and Au...
Sort
View
ICASSP
2009
IEEE
14 years 2 months ago
COSINE - A corpus of multi-party COnversational Speech In Noisy Environments
We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party con...
Alex Stupakov, Evan Hanusa, Jeff A. Bilmes, Dieter...
ICASSP
2008
IEEE
14 years 2 months ago
Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynami
We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...
Athanassios Katsamanis, George Papandreou, Petros ...
PCM
2007
Springer
160views Multimedia» more  PCM 2007»
14 years 1 months ago
3D Tracking of a Soccer Ball Using Two Synchronized Cameras
Abstract. We propose an adaptive method that can estimate 3D position of a soccer ball by using two viewpoint videos. The 3D position of a ball is essential to realize a 3D free vi...
Norihiro Ishii, Itaru Kitahara, Yoshinari Kameda, ...
EJASMP
2010
112views more  EJASMP 2010»
13 years 2 months ago
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While...
Mickael Rouvier, Georges Linares, Benjamin Lecoute...
TELSYS
1998
154views more  TELSYS 1998»
13 years 7 months ago
The Multimedia Internet Terminal (MInT)
The Multimedia Internet Terminal (MINT)1 is a flexible multimedia tool set that allows the establishment and control of multimedia sessions across the Internet. The system archit...
Dorgham Sisalem, Henning Schulzrinne