A Speech Corpus for Modeling Language Acquisition: CAREGIVER

15 years 9 months ago

Download www.lrec-conf.org

A multi-lingual speech corpus used for modeling language acquisition called CAREGIVER has been designed and recorded within the framework of the EU funded Acquisition of Communication and Recognition Skills (ACORNS) project. The paper describes the motivation behind the corpus and its design by relying on current knowledge regarding infant language acquisition. Instead of recording infants and children, the voices of their primary and secondary caregivers were captured in both infant-directed and adultdirected speech modes over four languages in a read speech manner. The challenges and methods applied to obtain similar prompts in terms of complexity and semantics across different languages, as well as the normalized recording procedures employed at different locations, is covered. The corpus contains nearly 66000 utterance based audio files spoken over a two-year period by 17 male and 17 female native speakers of Dutch, English, Finnish, and Swedish. An orthographical transcription is...

Toomas Altosaar, Louis ten Bosch, Guillaume Aimett

Real-time Traffic

Acquisition Called Caregiver | Education | Language Acquisition | LREC 2010 | Multi-lingual Speech Corpus |

claim paper

» Automatic Acquisition of Language Model based on HeadDependent Relation between Words

» Phoneme acquisition model based on vowel imitation using Recurrent Neural Network

» On a Computational Model for Language Acquisition Modeling CrossSpeaker Generalisation

» Domain Adaptation of Maximum Entropy Language Models

» CorpusBased Tools for ComputerAssisted Acquisition of Reading Abilities in Cognate Languag...

» HIFIAV An Audiovisual Corpus for Spoken Language HumanMachine Dialogue Research in Spanish

» How Spoken Language Corpora Can Refine Current Speech Motor Training Methodologies

» A CaseBased Reasoning Approach for Speech Corpus Generation

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Toomas Altosaar, Louis ten Bosch, Guillaume Aimetti, Christos Koniaris, Kris Demuynck, Henk van den Heuvel

Comments (0)

Sciweavers

A Speech Corpus for Modeling Language Acquisition: CAREGIVER

Acquisition Called Caregiver | Education | Language Acquisition | LREC 2010 | Multi-lingual Speech Corpus |

Explore & Download

Productivity Tools

Sciweavers