A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons

14 years 2 months ago

Download www.lrec-conf.org

This paper proposes a simple and fast person-name filter, which plays an important role in automatic compilation of a large bilingual person-name lexicon. This filter is based on pn score, which is the sum of two component scores, the score of the first name and that of the last name. Each score is calculated from two term sets: one is a dense set in which most of the members are person names; another is a baseline set that contains less person names. The pn score takes one of five values, {+2, +1, 0, -1, -2 }, which correspond to strong positive, positive, undecidable, negative, and strong negative, respectively. This pn score can be easily extended to bilingual pn score that takes one of nine values, by summing scores of two languages. Experimental results show that our method works well for monolingual person names in English and Japanese; the F-score of each language is 0.929 and 0.939, respectively. The performance of the bilingual person-name filter is better; the F-score is 0.9...

Satoshi Sato, Sayoko Kaide

Real-time Traffic

Education | LREC 2010 | Person Names | Person-name Filter | Pn Score |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Satoshi Sato, Sayoko Kaide

Comments (0)

Sciweavers

A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons

Education | LREC 2010 | Person Names | Person-name Filter | Pn Score |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers