Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks

13 years 8 months ago

Download mirlab.org

Segmentation of speech signals is a crucial task in many types of speech analysis. We present a novel approach at segmentation on a syllable level, using a Bidirectional Long-Short-Term Memory Neural Network. It performs estimation of syllable nucleus positions based on regression of perceptually motivated input features to a smooth target function. Peak selection is performed to attain valid nuclei positions. Performance of the model is evaluated on the levels of both syllables and the vowel segments making up the syllable nuclei. The general applicability of the approach is illustrated by good results for two common databases—Switchboard and TIMIT—for both read and spontaneous speech, and a favourable comparison with other published results.

Christian Landsiedel, Jens Edlund, Florian Eyben,

Real-time Traffic

Bidirectional Long-Short-Term Memory | ICASSP 2011 | Signal Processing | Syllable Nucleus Positions | Valid Nuclei Positions |

claim paper

» Audiovisual classification of vocal outbursts in human conversation using LongShortTerm Me...

» Combining monaural source separation with Long ShortTerm Memory for increased robustness i...

» A multistream ASR framework for BLSTM modeling of conversational speech

» Robust Multistream Keyword and Nonlinguistic Vocalization Detection for Computationally In...

» Robust discriminative keyword spotting for emotionally colored spontaneous speech using bi...

» Audio recognition in the wild Static and dynamic classification on a realworld database of...

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Christian Landsiedel, Jens Edlund, Florian Eyben, Daniel Neiberg, Björn Schuller

Comments (0)

Sciweavers

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks

Bidirectional Long-Short-Term Memory | ICASSP 2011 | Signal Processing | Syllable Nucleus Positions | Valid Nuclei Positions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers