Sciweavers

LREC
2010

A Database of Age and Gender Annotated Telephone Speech

14 years 1 months ago
A Database of Age and Gender Annotated Telephone Speech
This article describes an age-annotated database of German telephone speech. All in all 47 hours of prompted and free text was recorded, uttered by 954 paid participants in a style typical for automated voice services. The participants were selected based on an equal distribution of males and females within four age cluster groups; children, youth, adults and seniors. Within the children, gender is not distinguished, because it doesn't have a strong enough effect on the voice. The textual content was designed to be typical for automated voice services and consists mainly of short commands, single words and numbers. An additional database consists of 659 speakers (368 female and 291 male) that called an automated voice portal server and answered freely on one of the two questions "What is your favourite dish?" and "What would you take to an island?" (island set, 422 speakers). This data might be used for out-of domain testing. The data will be used to tune an a...
Felix Burkhardt, Martin Eckert, Wiebke Johannsen,
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2010
Where LREC
Authors Felix Burkhardt, Martin Eckert, Wiebke Johannsen, Joachim Stegmann
Comments (0)