Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition

15 years 8 months ago

Download www.lrec-conf.org

In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech corpus. The corpus is intended for training and testing of existing audio-visual speech recognition system. The name of the database is UWB-07-ICAVR, where ICAVR stands for Impaired Condition Audio Visual speech Recognition. The corpus consist of 10000 utterances of continuous speech obtained from 50 speakers. The total length of the database is 25 hours. Each utterance is stored as a separate sentence. The corpus extends existing databases by covering condition of variable illumination. We acquired 50 speakers, where half of them were men and half of them were women. Recording was done by two cameras and two microphones. Database introduced in this paper can be used for testing of visual parameterization in audio-visual speech recognition (AVSR). Corpus can be easily split into training and testing part. Each speaker pronounced 200 sentences: first 50 were the same for all, the rest of t...

Jana Trojanová, Marek Hrúz, Pavel Ca

Real-time Traffic

Audio-visual Speech | Audio-Visual Speech Recognition | Education | LREC 2008 | Speech Recognition |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Jana Trojanová, Marek Hrúz, Pavel Campr, Milos Zelezný

Sciweavers

Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition

Audio-visual Speech | Audio-Visual Speech Recognition | Education | LREC 2008 | Speech Recognition |

Explore & Download

Productivity Tools

Sciweavers