Sciweavers

ICASSP
2010
IEEE

Unsupervised knowledge acquisition for Extracting Named Entities from speech

13 years 7 months ago
Unsupervised knowledge acquisition for Extracting Named Entities from speech
This paper presents a Named Entity Recognition (NER) method dedicated to process speech transcriptions. The main principle behind this method is to collect in an unsupervised way lexical knowledge for all entries in the ASR lexicon. This knowledge is gathered with two methods: by automatically extracting NEs on a very large set of textual corpora and by exploiting directly the structure contained in the Wikipedia resource. This lexical knowledge is used to update the statistical models of our NER module based on a mixed approach with generative models (Hidden Markov Models - HMM) and discriminative models (Conditional Random Field - CRF). This approach has been evaluated within the French ESTER 2 evaluation program and obtained the best results at the NER task on ASR transcripts.
Frédéric Béchet, Eric Charton
Added 17 May 2011
Updated 17 May 2011
Type Journal
Year 2010
Where ICASSP
Authors Frédéric Béchet, Eric Charton
Comments (0)