Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

14 years 10 months ago

Download homepages.inf.ed.ac.uk

The AMI Meeting Corpus contains 100 hours of meetings captured using many synchronized recording devices, and is designed to support work in speech and video processing, language engineering, corpus linguistics, and organizational psychology. It has been transcribed orthographically, with annotated subsets for everything from named entities, dialogue acts, and summaries to simple gaze and head movement. In this written version of an LREC conference keynote address, I describe the data and how it was created. If this is ”killer” data, that presupposes a platform that it will ”sell”; in this case, that is the NITE XML Toolkit, which allows a distributed set of users to create, store, browse, and search annotations for the same base data that are both time-aligned against signal and related to each other structurally.

Jean Carletta

Real-time Traffic

AMI Meeting Corpus | LRE 2007 | Meetings | Organizational Psychology |

claim paper

Post Info
More Details (n/a)

Added	16 Dec 2010
Updated	16 Dec 2010
Type	Journal
Year	2007
Where	LRE
Authors	Jean Carletta

Comments (0)

Sciweavers

Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

AMI Meeting Corpus | LRE 2007 | Meetings | Organizational Psychology |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers