Integrating Multi-level Linguistic Knowledge with a Unified Framework for Mandarin Speech Recognition

14 years 1 months ago

Download www.aclweb.org

To improve the Mandarin large vocabulary continuous speech recognition (LVCSR), a unified framework based approach is introduced to exploit multi-level linguistic knowledge. In this framework, each knowledge source is represented by a Weighted Finite State Transducer (WFST), and then they are combined to obtain a so-called analyzer for integrating multi-level knowledge sources. Due to the uniform transducer representation, any knowledge source can be easily integrated into the analyzer, as long as it can be encoded into WFSTs. Moreover, as the knowledge in each level is modeled independently and the combination is processed in the model level, the information inherently in each knowledge source has a chance to be thoroughly exploited. By simulations, the effectiveness of the analyzer is investigated, and then a LVCSR system embedding the presented analyzer is evaluated. Experimental results reveal that this unified framework is an effective approach which significantly improves the pe...

Xinhao Wang, Jiazhong Nie, Dingsheng Luo, Xihong W

Real-time Traffic

EMNLP 2008 | Knowledge Source | Natural Language Processing | Speech Recognition | Unified Framework |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	EMNLP
Authors	Xinhao Wang, Jiazhong Nie, Dingsheng Luo, Xihong Wu

Comments (0)

Sciweavers

Integrating Multi-level Linguistic Knowledge with a Unified Framework for Mandarin Speech Recognition

EMNLP 2008 | Knowledge Source | Natural Language Processing | Speech Recognition | Unified Framework |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers