Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon

15 years 8 months ago

Download www.lrec-conf.org

This paper deals with the task of large vocabulary proper name recognition. In order to accomodate a wide diversity of possible name pronunciations (due to non-native name origins or speaker tongues) a multilingual acoustic model is combined with a lexicon comprising 3 grapheme-to-phoneme (G2P) transcriptions (from G2P transcribers for 3 different languages) and up to 4 so-called phoneme-tophoneme (P2P) transcriptions. The latter are generated with (speaker tongue, name source) specific P2P converters that try to transform a set of baseline name transcriptions into a pool of transcription variants that lie closer to the `true' name pronunciations. The experimental results show that the generated P2P variants can be employed to improve name recognition, and that the obtained accuracy is comparable to what is achieved with typical (TY) transcriptions (made by a human expert). Furthermore, it is demonstrated that the P2P conversion can best be instantiated from a baseline transcript...

Bert Réveil, Jean-Pierre Martens, Henk van

Real-time Traffic

Education | Generated P2p Variants | LREC 2010 | Speaker Tongue | Specific P2p Converters |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel

Sciweavers

Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon

Education | Generated P2p Variants | LREC 2010 | Speaker Tongue | Specific P2p Converters |

Explore & Download

Productivity Tools

Sciweavers