Predicting human perception and ASR classification of word-final [t] by its acoustic sub-segmental properties

13 years 7 months ago

Download pubman.mpdl.mpg.de

This paper presents a study on the acoustic sub-segmental properties of word-final /t/ in conversational standard Dutch and how these properties contribute to whether humans and an ASR system classify the /t/ as acoustically present or absent. In general, humans and the ASR system use the same cues (presence of a constriction, a burst, and alveolar friction), but the ASR system is also less sensitive to fine cues (weak bursts, smoothly starting friction) than human listeners and misled by the presence of glottal vibration. These data inform the further development of models of human and automatic speech processing.

Barbara Schuppler, Mirjam Ernestus, Wim A. van Dom

Real-time Traffic

Acoustic Sub-segmental Properties | Automatic Speech Processing | Conversational Standard Dutch | INTERSPEECH 2010 | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Barbara Schuppler, Mirjam Ernestus, Wim A. van Dommelen, Jacques C. Koreman

Comments (0)

Sciweavers

Predicting human perception and ASR classification of word-final [t] by its acoustic sub-segmental properties

Acoustic Sub-segmental Properties | Automatic Speech Processing | Conversational Standard Dutch | INTERSPEECH 2010 | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers