Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

Voted

CLIN
2000

113views Computational Linguistics» more CLIN 2000»

Proper Name Extraction from Non-Journalistic Texts

15 years 8 months ago

Proper Name Extraction from Non-Journalistic Texts

Download eref.uqu.edu.sa

This paper discusses the influence of the corpus on the automatic identification of proper names in texts. Techniques developed for the newswire genre are generally not sufficient to deal with larger corpora containing texts that do not follow strict writing constraints (for example, e-mail messages, transcriptions of oral conversations, etc). After a brief review of the research performed on news texts, we present some of the problems involved in the analysis of two different corpora: e-mails and hand-transcribed telephone conversations. Once the sources of errors have been presented, we then describe an approach to adapt a proper name extraction system developed for newspaper texts to the analysis of e-mail messages. Key-words: Proper Name Extraction, Corpus, Information Extraction

Thierry Poibeau, Leila Kosseim

Real-time Traffic

CLIN 2000 | CLIN 2004 | E-mail Messages | Proper Name | Proper Name Extraction |

claim paper

Related Content

» FiniteState Transducer Cascade to Extract Proper Names in Texts

» Proper Names Extraction from Fax Images Combining Textual and Image Features

» Neural NetworkBased Proper Names Extraction in Fax Images

» Transliteration of proper names in crosslanguage applications

» Identification and Classification of Proper Nouns in Chinese Texts

» A Method of Geographical Name Extraction from Japanese Text for Thematic Geographical Sear...

» Text Extraction from Name Cards with Complex Design

» Cultural Heritage Knowledge Extraction from Web Documents

» Extracting person names from diverse and noisy OCR text

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	CLIN
Authors	Thierry Poibeau, Leila Kosseim

Comments (0)