How Many Different "John Smiths", and Who Are They?

14 years 1 months ago

Download www.d.umn.edu

In this work we propose three unsupervised measures to automatically identify the number of distinct entities a given ambiguous name refers to in a corpus. We experiment with 22 artificially created name conflations and observe that the measure (PK2) formulated as the ratio of two successive clustering criterion function values outperforms the other two measures. We also describe a method to assign a unique label to each discovered cluster so as to identify the underlying entity that it refers to.

Anagha Kulkarni, Ted Pedersen

Real-time Traffic

AAAI 2006 | Distinct Entities | Intelligent Agents | Successive Clustering Criterion | Unsupervised Measures |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	AAAI
Authors	Anagha Kulkarni, Ted Pedersen

Comments (0)

Sciweavers

How Many Different "John Smiths", and Who Are They?

AAAI 2006 | Distinct Entities | Intelligent Agents | Successive Clustering Criterion | Unsupervised Measures |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers