Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment

15 years 6 months ago

Download www.research.att.com

This paper presents a method of automatically creating hypermedia documents from conventional transcriptions of television programs. Using parallel text alignment techniques, the temporal information derived from the closed caption signal is exploited to convert the transcription into a synchronized text stream. Given this text stream, we can create links between the transcription and the image and audio media streams. We describe a two-pass method for aligning parallel texts that first uses dynamic programming techniques to maximize the number corresponding words (by minimizing the word edit distance). The second stage converts the word alignment into a sentence alignment, taking into account the cases of sentence split and merge. We present results of text alignment on a database of 610 programs (including three television news programs over a oneyear period) for which we have closed caption, transcript, audio and image streams. The techniques presented here can produce high quality...

David C. Gibbon

Real-time Traffic

Document Analysis | Parallel Text Alignment | RIDE 1998 | Text Alignment | Text Stream |

claim paper

Added	05 Aug 2010
Updated	05 Aug 2010
Type	Conference
Year	1998
Where	RIDE
Authors	David C. Gibbon

Sciweavers

Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment

Document Analysis | Parallel Text Alignment | RIDE 1998 | Text Alignment | Text Stream |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers