Using cross-decoder phone coocurrences in phonotactic language recognition

15 years 6 months ago

Download gtts.ehu.es

Phonotactic language recognizers are based on the ability of phone decoders to produce phone sequences containing acoustic, phonetic and phonological information, which is partially dependent on the language. Input utterances are decoded and then scored by means of models for the target languages. Commonly, various decoders are applied in parallel and fused at the score level. A kind of complementarity effect is expected when fusing scores, since each decoder is assumed to extract different (and complementary) information from the input utterance. This assumption is supported by the performance improvements attained when fusing systems. However, decodings are processed in a fully uncoupled way, their time alignment (and the information that may be extracted from it) being completely lost. In this paper, a simple approach is proposed, which takes into account time alignment information, by considering cross-decoder phone coocurrences at the frame level. To evaluate the approach, a choi...

Mikel Peñagarikano, Amparo Varona, Luis Jav

Real-time Traffic

Decoders | ICASSP 2010 | Input Utterance | Phone Decoders | Signal Processing |

claim paper

» Active Learning for Classifying Phone Sequences from Unsupervised Phonotactic Models

» Tuning phone decoders for language identification

» On Acoustic Diversification FrontEnd for Spoken Language Identification

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Germán Bordel

Comments (0)

Sciweavers

Using cross-decoder phone coocurrences in phonotactic language recognition

Decoders | ICASSP 2010 | Input Utterance | Phone Decoders | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers