Sciweavers

112 search results - page 9 / 23
» Epoch Extraction From Speech Signals
Sort
View
ICPR
2006
IEEE
14 years 8 months ago
Phoneme segmentation of speech
In most approaches to speech recognition, the speech signals are segmented using constant-time segmentation, for example into 25 ms blocks. Constant segmentation risks losing info...
Bartosz Ziólko, Suresh Manandhar, Richard C...
ICASSP
2008
IEEE
14 years 1 months ago
Caption-aided speech detection in videos
This paper presents a novel audio-visual fusion method for speech detection, which is an important front-end for content-based video processing. This approach aims to extract homo...
Cong Li, Zhijian Ou, Wei Hu, Tao Wang, Yimin Zhang
TSD
1999
Springer
13 years 11 months ago
Classifying Visemes for Automatic Lipreading
Automatic lipreading is automatic speech recognition that uses only visual information. The relevant data in a video signal is isolated and features are extracted from it. From a s...
Michiel Visser, Mannes Poel, Anton Nijholt
ICASSP
2011
IEEE
12 years 11 months ago
Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis
This paper proposes a technique for improving tone correctness in Thai speech synthesis based on an average voice model trained with nonprofessional speech corpus. The proposed te...
Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
ICASSP
2010
IEEE
13 years 7 months ago
Perceptual audio features for unsupervised key-phrase detection
We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting short sequences of spoken words (key-phrases) within long speech recordings. Our ...
Dirk von Zeddelmann, Frank Kurth, Meinard Mül...