Searching in audio: the utility of transcripts, dichotic presentation, and time-compression

15 years 25 days ago

Download www.dgp.toronto.edu

Searching audio data can potentially be facilitated by the use of automatic speech recognition (ASR) technology to generate text transcripts which can then be easily queried. However, since current ASR technology cannot reliably generate 100% accurate transcripts, additional techniques for fluid browsing and searching of the audio itself are required. We explore the impact of transcripts of various qualities, dichotic presentation, and time-compression on an audio search task. Results show that dichotic presentation and reasonably accurate transcripts can assist in the search process, but suggest that time-compression and low accuracy transcripts should be used carefully. Author Keywords Dichotic listening, transcripts, audio time-compression. ACM Classification Keywords H5.2 [User Interfaces]: Interaction styles, Auditory interfaces

Abhishek Ranjan, Ravin Balakrishnan, Mark H. Chign

Real-time Traffic

Accurate Transcripts | CHI 2006 | Human Computer Interaction | Keywords Dichotic Listening | Low Accuracy Transcripts |

claim paper

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2006
Where	CHI
Authors	Abhishek Ranjan, Ravin Balakrishnan, Mark H. Chignell

Comments (0)

Sciweavers

Searching in audio: the utility of transcripts, dichotic presentation, and time-compression

Accurate Transcripts | CHI 2006 | Human Computer Interaction | Keywords Dichotic Listening | Low Accuracy Transcripts |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers