A dynamic approach to the selection of high order n-grams in phonotactic language recognition

14 years 11 months ago

Download mirlab.org

Due to computational bounds, most SVM-based phonotactic language recognition systems consider only low-order n-grams (up to n = 3), thus limiting the potential performance of this approach. The huge amount of n-grams for n ≥ 4 makes it computationally unfeasible even selecting the most frequent n-grams. In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6, 7 in SVM-based phonotactic language recognition, thanks to a dynamic n-gram selection algorithm. The most frequent n-grams are selected, but computational issues (those regarding memory requirements) are prevented, since counts are periodically updated and only those units with the highest counts are retained for subsequent processing. Systems were built by means of open software (Brno University of Technology phone decoders, HTK, LIBLINEAR and FoCal) and experiments were carried out on the NIST LRE2007

Mikel Peñagarikano, Amparo Varona, Luis Jav

Real-time Traffic

Frequent N-grams | ICASSP 2011 | Phonotactic Language Recognition | Signal Processing | SVM-based Phonotactic Language |

claim paper

» TimeFrequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Langu...

» Submotions for Hidden Markov Model Based Dynamic Facial Action Recognition

» Bayesian Models for Keyhole Plan Recognition in an Adventure Game

» Exploring HighD Spaces with Multiform Matrices and Small Multiples

» Action recognition using exemplarbased embedding

» Dynamic Software Architecture Slicing

» Tool Support for Design Pattern Recognition at Model Level

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Germán Bordel

Comments (0)

Sciweavers

A dynamic approach to the selection of high order n-grams in phonotactic language recognition

Frequent N-grams | ICASSP 2011 | Phonotactic Language Recognition | Signal Processing | SVM-based Phonotactic Language |

Explore & Download

Productivity Tools

Sciweavers