Collecting Voices from the Cloud

15 years 8 months ago

Download www.lrec-conf.org

The collection and transcription of speech data is typically an expensive and time-consuming task. Voice over IP and cloud computing are poised to greatly reduce this impediment to research on spoken language interfaces in many domains. This paper documents our efforts to deploy speech-enabled web interfaces to large audiences over the Internet via Amazon Mechanical Turk, an online marketplace for work. Using the open source WAMI Toolkit, we collected corpora in two different domains which collectively constitute over 113 hours of speech. The first corpus contains 100,000 utterances of read speech, and was collected by asking workers to record street addresses in the United States. For the second task, we collected conversations with FlightBrowser, a multimodal spoken dialogue system. The FlightBrowser corpus obtained contains 10,651 utterances composing 1,113 individual dialogue sessions from 101 distinct users. The aggregate time spent collecting the data for both corpora was just u...

Ian McGraw, Chia-ying Lee, I. Lee Hetherington, St

Real-time Traffic

Education | LREC 2010 | Multimodal Spoken Dialogue | Speech-enabled Web Interfaces | Spoken Language Interfaces |

claim paper

» Cloud Computing and the Lessons from the Past

» Context preserving dynamic word cloud visualization

» ContextPreserving Dynamic Word Cloud Visualization

» VDN Virtual machine image distribution network for cloud data centers

» From StructurefromMotion Point Clouds to Fast Location Recognition

» Voice attributes affecting likability perception

» Power of Clouds in Your Pocket An Efficient Approach for Cloud Mobile Hybrid Application D...

» Typhoon Locating and Reconstruction from the Infrared Satellite Cloud Image

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Ian McGraw, Chia-ying Lee, I. Lee Hetherington, Stephanie Seneff, Jim Glass

Comments (0)

Sciweavers

Collecting Voices from the Cloud

Education | LREC 2010 | Multimodal Spoken Dialogue | Speech-enabled Web Interfaces | Spoken Language Interfaces |

Explore & Download

Productivity Tools

Sciweavers