Score fusion and calibration in multiple language detectors with large performance variation

13 years 3 months ago

Download mirlab.org

In a large-scale language detection task, performance variation found between different component systems and different target languages has an adverse effect to the pooled error statistics. Special care has to be taken in score fusion and calibration. In this paper, we use a prosodic LID system to fuse with a phonotactic LID system using NIST Language Recognition Evaluation 2009 experimental data. Among four logistic regression models, the one which gives the lowest Cavg is chosen. We further explore our previously proposed calibration algorithm based on the minimum erroneous deviation criterion. The algorithm is made more robust by removing the predetermined list of target languages to be calibrated, as well as by adding an optimization constraint which enforces calibration in the data portion with a large performance variation. The fusion and calibration operations together bring a 33.9% relative Cavg reduction compared with the original result from a phonotactic LID system.

Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin M

Real-time Traffic

ICASSP 2011 | Logistic Regression Models | Performance Variation | Signal Processing | Target Languages |

claim paper

Post Info
More Details (n/a)

Added	29 Aug 2011
Updated	29 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li

Comments (0)

Sciweavers

Score fusion and calibration in multiple language detectors with large performance variation

ICASSP 2011 | Logistic Regression Models | Performance Variation | Signal Processing | Target Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers