Sciweavers

JCDL
2006
ACM

Search engine driven author disambiguation

14 years 6 months ago
Search engine driven author disambiguation
In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836. Categories and Subject Descriptors: H.3.3 Information Systems – Information Search and Retrieval General Terms: Algorithms
Yee Fan Tan, Min-Yen Kan, Dongwon Lee
Added 14 Jun 2010
Updated 14 Jun 2010
Type Conference
Year 2006
Where JCDL
Authors Yee Fan Tan, Min-Yen Kan, Dongwon Lee
Comments (0)