Sciweavers

123 search results - page 7 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
ICASSP
2011
IEEE
12 years 11 months ago
Improving acoustic event detection using generalizable visual features and multi-modality modeling
Acoustic event detection (AED) aims to identify both timestamps and types of multiple events and has been found to be very challenging. The cues for these events often times exist...
Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnso...
ICASSP
2011
IEEE
12 years 11 months ago
HMM-based speech synthesiser using the LF-model of the glottal source
A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech. ...
João P. Cabral, Steve Renals, Junichi Yamag...
ICASSP
2010
IEEE
13 years 7 months ago
Optimizing spectral subtraction and wiener filtering for robust speech recognition in reverberant and noisy conditions
Speech enhancement is a common approach to address the effects of degradation due to noise and channel contamination. This approach is intended to suppress unwanted signal and rec...
Randy Gomez, Tatsuya Kawahara
ICASSP
2011
IEEE
12 years 11 months ago
Investigation of acoustic units for LVCSR systems
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Context dependent (CD) phones remain the dominant form of acoustic units. They can...
Xunying Liu, Mark John Francis Gales, Jim L. Hiero...
ICASSP
2011
IEEE
12 years 11 months ago
The IBM 2009 GALE Arabic speech transcription system
We describe the Arabic broadcast transcription system elded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements ...
Brian Kingsbury, Hagen Soltau, George Saon, Stephe...