Speech modeling based on committee-based active learning

13 years 11 months ago

Download www.furui.cs.titech.ac.jp

We propose a committee-based active learning method for large vocabulary continuous speech recognition. In this approach, multiple recognizers are prepared beforehand, and the recognition results obtained from them are used for selecting utterances. Here, a progressive search method is used for aligning sentences, and voting entropy is used as a measure for selecting utterances. We apply our method not only to acoustic models but also to language models and their combination. Our method was evaluated by using 190-hour speech data in the Corpus of Spontaneous Japanese. It proved to be signiﬁcantly better than random selection. It only required 63 h of data to achieve a word accuracy of 74%, while standard training (i.e., random selection) required 97 h of data. The recognition accuracy of our proposed method was also better than that of the conventional uncertainty sampling method using word posterior probabilities as the conﬁdence measure for selecting sentences.

Yuzu Hamanaka, Koichi Shinoda, Sadaoki Furui, Tada

Real-time Traffic

Active Learning Method | Continuous Speech Recognition | ICASSP 2010 | Progressive Search Method | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Yuzu Hamanaka, Koichi Shinoda, Sadaoki Furui, Tadashi Emori, Takafumi Koshinaka

Comments (0)

Sciweavers

Speech modeling based on committee-based active learning

Active Learning Method | Continuous Speech Recognition | ICASSP 2010 | Progressive Search Method | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers