Accurate transcription of broadcast news speech using multiple noisy transcribers and unsupervised reliability metrics

13 years 6 months ago

Download www-scf.usc.edu

Professional manual transcription of speech is an expensive and time consuming process. This paper focuses on the problem of combining noisy transcriptions from multiple non-expert transcribers, where the quality of work from each worker varies. Computing transcriber reliability is a difﬁcult task in the absence of gold standard reference transcripts. Three simple metrics for quantifying this reliability without using a gold standard are proposed. We create a database of 1000 Mexican Spanish broadcast news audio clips transcribed by ﬁve transcribers each through Amazon Mechanical Turk. Combination of multiple noisy transcripts using these reliability scores improves the word error rate of the combined transcript with respect to the LDC gold standard by 8 % relative, and the sentence error rate by 4.1 % relative, when compared with a combination without any reliability information.

Kartik Audhkhasi, Panayiotis G. Georgiou, Shrikant

Real-time Traffic

Error Rate | Gold Standard | ICASSP 2011 | Multiple Non-expert Transcribers | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Kartik Audhkhasi, Panayiotis G. Georgiou, Shrikanth S. Narayanan

Comments (0)

Sciweavers

Accurate transcription of broadcast news speech using multiple noisy transcribers and unsupervised reliability metrics

Error Rate | Gold Standard | ICASSP 2011 | Multiple Non-expert Transcribers | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers