Speech Recognition for a Digital Video Library

14 years 1 months ago

Download www.informedia.cs.cmu.edu

The standard method for making the full content of audio and video material searchable and is to annotate it with humangenerated meta-data that describes the content in a way that the search can understand, as is done in the creation of multimedia CD-ROMs. However, for the huge amounts of data that could usefully be included in digital video and audio libraries, the cost of producing this meta-data is prohibitive. In the Informedia Digital Video Library, the production of the meta-data supporting the library interface is automated using techniques derived from artificial intelligence (AI) research. By applying speech recognition together with natural language processing, information retrieval and image analysis, an interface has been produced that helps users locate the information they want and navigate or browse the digital video library more effectively. Specific interface components include automatic titles, filmstrips, video skims, word location marking and representative frames ...

Michael J. Witbrock, Alexander G. Hauptmann

Real-time Traffic

Digital Video | Information Retrieval | JASIS 1998 | Speech Recognition |

claim paper

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	1998
Where	JASIS
Authors	Michael J. Witbrock, Alexander G. Hauptmann

Comments (0)

Sciweavers

Speech Recognition for a Digital Video Library

Digital Video | Information Retrieval | JASIS 1998 | Speech Recognition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers