Sciweavers

MLMI
2005
Springer

The Development of the AMI System for the Transcription of Speech in Meetings

14 years 5 months ago
The Development of the AMI System for the Transcription of Speech in Meetings
The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. This paper describes the development of a baseline automatic speech transcription system for meetings in the context of the AMI (Augmented Multiparty Interaction) project. We present several techniques important to processing of this data and show the performance in terms of word error rates (WERs). An important aspect of transcription of this data is the necessary flexibility in terms of audio pre-processing. Real world systems have to deal with flexible input, for example by using microphone arrays or randomly placed microphones in a room. Automatic segmentation and microphone array processing techniques are described and the effect on WERs is discussed. The system and its components presented in this paper yield compettive performance and form a baseline for future research in this domain.
Thomas Hain, Lukas Burget, John Dines, Iain McCowa
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where MLMI
Authors Thomas Hain, Lukas Burget, John Dines, Iain McCowan, Giulia Garau, Martin Karafiát, Mike Lincoln, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals
Comments (0)