RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus

15 years 8 months ago

Download www.lrec-conf.org

This paper describes the Norwegian broadcast news speech corpus RUNDKAST. The corpus contains recordings of approximately 77 hours of broadcast news shows from the Norwegian broadcasting company NRK. The corpus covers both read and spontaneous speech as well as spontaneous dialogues and multipart discussions, including frequent occurrences of non-speech material (e.g. music, jingles). The recordings have large variations in speaking styles, dialect use and recording/transmission quality. RUNDKAST has been annotated for research in speech technology. The entire corpus has been manually segmented and transcribed using hierarchical levels. A subset of one hour of read and spontaneous speech from 10 different speakers has been manually annotated using broad phonetic labels. We provide a description of the database content, the annotation tools and strategies, and the conventions used for the different levels of annotation. A corpus of this kind has up to this point not been available for ...

Ingunn Amdal, Ole Morten Strand, Jørn Almbe

Real-time Traffic

Corpus Contains Recordings | Education | LREC 2008 | Speech Corpus Rundkast | Spontaneous Speech |

claim paper

» Many uses many annotations for large speech corpora Switchboard and TDT as case studies

» From Speech to Trees Applying Treebank Annotation to Arabic Broadcast News

» DiSCo A German Evaluation Corpus for Challenging Problems in the Broadcast Domain

» Lightly supervised and unsupervised acoustic model training

» Multimodal Speaker Identification Based on Text and Speech

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Ingunn Amdal, Ole Morten Strand, Jørn Almberg, Torbjørn Svendsen

Comments (0)

Sciweavers

RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus

Corpus Contains Recordings | Education | LREC 2008 | Speech Corpus Rundkast | Spontaneous Speech |

Explore & Download

Productivity Tools

Sciweavers