Why Is the Recognition of Spontaneous Speech so Hard?

16 years 6 days ago

Download www.furui.cs.titech.ac.jp

Although speech, derived from reading texts, and similar types of speech, e.g. that from reading newspapers or that from news broadcast, can be recognized with high accuracy, recognition accuracy drastically decreases for spontaneous speech. This is due to the fact that spontaneous speech and read speech are signiﬁcantly diﬀerent acoustically as well as linguistically. This paper reports analysis and recognition of spontaneous speech using a large-scale spontaneous speech database “Corpus of Spontaneous Japanese (CSJ)”. Recognition results in this experiment show that recognition accuracy signiﬁcantly increases as a function of the size of acoustic as well as language model training data and the improvement levels oﬀ at approximately 7M words of training data. This means that acoustic and linguistic variation of spontaneous speech is so large that we need a very large corpus in order to encompass the variations. Spectral analysis using various styles of utterances in the CS...

Sadaoki Furui, Masanobu Nakamura, Tomohisa Ichiba,

Real-time Traffic

Large-scale Spontaneous Speech | Recognition Accuracy | Signal Processing | Spontaneous Speech | TSD 2005 |

claim paper

» Comparison of Spectral Properties of Read Prepared and Casual Speech in French

» Which words are hard to recognize Prosodic lexical and disfluency factors that increase sp...

» A Speaker Clustering Algorithm for Fast Speaker Adaptation in Continuous Speech Recognitio...

» Acoustic model training for nonaudible murmur recognition using transformed normal speech ...

» TermWeighting for Summarization of Multiparty Spoken Dialogues

» Static vs dynamic modeling of human nonverbal behavior from multiple cues and modalities

» Emotion Recognition Based on Physiological Changes in Music Listening

» Detecting emotional state of a child in a conversational computer game

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	TSD
Authors	Sadaoki Furui, Masanobu Nakamura, Tomohisa Ichiba, Koji Iwano

Comments (0)

Sciweavers

Why Is the Recognition of Spontaneous Speech so Hard?

Large-scale Spontaneous Speech | Recognition Accuracy | Signal Processing | Spontaneous Speech | TSD 2005 |

Explore & Download

Productivity Tools

Sciweavers