Building transcribed speech corpora quickly and cheaply for many languages

15 years 2 months ago

Download static.googleusercontent.com

We present a system for quickly and cheaply building transcribed speech corpora containing utterances from many speakers in a variety of acoustic conditions. The system consists of a client application running on an Android mobile device with an intermittent Internet connection to a server. The client application collects demographic information about the speaker, fetches textual prompts from the server for the speaker to read, records the speaker's voice, and uploads the audio and associated metadata to the server. The system has so far been used to collect over 3000 hours of transcribed audio in 17 languages around the world.

Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu

Real-time Traffic

Android Mobile Device | Client Application | Intermittent Internet Connection | INTERSPEECH 2010 | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro J. Moreno, Mike LeBeau

Comments (0)

Sciweavers

Building transcribed speech corpora quickly and cheaply for many languages

Android Mobile Device | Client Application | Intermittent Internet Connection | INTERSPEECH 2010 | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers