Sciweavers

300 search results - page 35 / 60
» The COST-277 Speech Database
Sort
View
ICIP
2003
IEEE
14 years 10 months ago
On automatic annotation of meeting databases
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and m...
Daniel Gatica-Perez, Hervé Bourlard, Iain M...
ICIP
2000
IEEE
14 years 10 months ago
Normalized Training for HMM-Based Visual Speech Recognition
This paper presents an approach to estimating the parameters of continuous density HMMs for visual speech recognition. One of the key issues of image-based visual speech recogniti...
Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamur...
ICASSP
2009
IEEE
14 years 3 months ago
Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion m
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Sungrack Yun, Chang D. Yoo
NOLISP
2007
Springer
14 years 2 months ago
A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines
This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-freque...
Sid-Ahmed Selouani, Habib Hamam, Douglas D. O'Shau...
TSD
2007
Springer
14 years 2 months ago
Festival-si: A Sinhala Text-to-Speech System
Abstract. This paper brings together the development of the first Text-toSpeech (TTS) system for Sinhala using the Festival framework and practical applications of it. Construction...
Ruvan Weerasinghe, Asanka Wasala, Viraj Welgama, K...