The Development of the AMI System for the Transcription of Speech in Meetings

15 years 6 months ago

Download homepages.inf.ed.ac.uk

The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. This paper describes the development of a baseline automatic speech transcription system for meetings in the context of the AMI (Augmented Multiparty Interaction) project. We present several techniques important to processing of this data and show the performance in terms of word error rates (WERs). An important aspect of transcription of this data is the necessary ﬂexibility in terms of audio pre-processing. Real world systems have to deal with ﬂexible input, for example by using microphone arrays or randomly placed microphones in a room. Automatic segmentation and microphone array processing techniques are described and the eﬀect on WERs is discussed. The system and its components presented in this paper yield compettive performance and form a baseline for future research in this domain.

Thomas Hain, Lukas Burget, John Dines, Iain McCowa

Real-time Traffic

Augmented Multiparty Interaction | Baseline Automatic Speech | Microphone Array | MLMI 2005 |

claim paper

» The TNO Speaker Diarization System for NIST RT05s Meeting Data

» Automatic Decision Detection in Meeting Speech

» Lowlatency online speaker tracking on the AMI Corpus of meeting conversations

» The 2007 AMIDA System for Meeting Transcription

» The AMI Meeting Corpus A Preannouncement

» Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN

» The IBM Rich Transcription 2007 SpeechtoText Systems for Lecture Meetings

» A Tangible Mixed Reality Interface for the AMI Automated Meeting Assistant

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	MLMI
Authors	Thomas Hain, Lukas Burget, John Dines, Iain McCowan, Giulia Garau, Martin Karafiát, Mike Lincoln, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals

Comments (0)

Sciweavers

The Development of the AMI System for the Transcription of Speech in Meetings

Augmented Multiparty Interaction | Baseline Automatic Speech | Microphone Array | MLMI 2005 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers