Sciweavers

ICASSP
2011
IEEE

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks

13 years 4 months ago
Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks
Segmentation of speech signals is a crucial task in many types of speech analysis. We present a novel approach at segmentation on a syllable level, using a Bidirectional Long-Short-Term Memory Neural Network. It performs estimation of syllable nucleus positions based on regression of perceptually motivated input features to a smooth target function. Peak selection is performed to attain valid nuclei positions. Performance of the model is evaluated on the levels of both syllables and the vowel segments making up the syllable nuclei. The general applicability of the approach is illustrated by good results for two common databases—Switchboard and TIMIT—for both read and spontaneous speech, and a favourable comparison with other published results.
Christian Landsiedel, Jens Edlund, Florian Eyben,
Added 20 Aug 2011
Updated 20 Aug 2011
Type Journal
Year 2011
Where ICASSP
Authors Christian Landsiedel, Jens Edlund, Florian Eyben, Daniel Neiberg, Björn Schuller
Comments (0)