Interactive visualisation techniques for dynamic speech transcription, correction and training

14 years 3 months ago

Download www.scss.tcd.ie

As performance gains in automatic speech recognition systems plateau, improvements to existing applications of speech recognition technology seem more likely to come from better user interface design than from further progress in core recognition components. Among all applications of speech recognition, the usability of systems for transcription of spontaneous speech is particularly sensitive to high word error rates. This paper presents a series of approaches to improving the usability of such applications. We propose new mechanisms for error correction, use of contextual information, and use of 3D visualisation techniques to improve user interaction with a recogniser and maximise the impact of user feedback. These proposals are illustrated through several prototypes which target tasks such as: off-line transcript editing, dynamic transcript editing, and real-time visualisation of recognition paths. An evaluation of our dynamic transcript editing system demonstrates the gains that ca...

Saturnino Luz, Masood Masoodian, Bill Rogers

Real-time Traffic

Automatic Speech | Automatic Speech Recognition | CHINZ 2008 | Human Computer Interaction | Speech Recognition |

claim paper

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CHINZ
Authors	Saturnino Luz, Masood Masoodian, Bill Rogers

Comments (0)

Sciweavers

Interactive visualisation techniques for dynamic speech transcription, correction and training

Automatic Speech | Automatic Speech Recognition | CHINZ 2008 | Human Computer Interaction | Speech Recognition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers