A low-power accelerator for the SPHINX 3 speech recognition system

14 years 5 months ago

Download www.cs.utah.edu

Accurate real-time speech recognition is not currently possible in the mobile embedded space where the need for natural voice interfaces is clearly important. The continuous nature of speech recognition coupled with an inherently large working set creates signiﬁcant cache interference with other processes. Hence real-time recognition is problematic even on high-performance general-purpose platforms. This paper provides a detailed analysis of CMU’s latest speech recognizer (Sphinx 3.2), identiﬁes three distinct processing phases, and quantiﬁes the architectural requirements for each phase. Several optimizations are then described which expose parallelism and drastically reduce the bandwidth and power requirements for real-time recognition. A special-purpose accelerator for the dominant Gaussian probability phase is developed for a 0.25µ CMOS process which is then analyzed and compared with Sphinx’s measured energy and performance on a 0.13µ 2.4 GHz Pentium 4 system. The res...

Binu K. Mathew, Al Davis, Zhen Fang

Real-time Traffic

CASES 2003 | Real-time Recognition | Special-purpose | Speech Recognition |

claim paper

Post Info
More Details (n/a)

Added	05 Jul 2010
Updated	05 Jul 2010
Type	Conference
Year	2003
Where	CASES
Authors	Binu K. Mathew, Al Davis, Zhen Fang

Comments (0)

Sciweavers

A low-power accelerator for the SPHINX 3 speech recognition system

CASES 2003 | Real-time Recognition | Special-purpose | Speech Recognition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers