Proximity-based document representation for named entity retrieval

14 years 4 months ago

Download maroo.cs.umass.edu

One aspect in which retrieving named entities is diﬀerent from retrieving documents is that the items to be retrieved – persons, locations, organizations – are only indirectly described by documents throughout the collection. Much work has been dedicated to ﬁnding references to named entities, in particular to the problems of named entity extraction and disambiguation. However, just as important for retrieval performance is how these snippets of text are combined to build named entity representations. We focus on the TREC expert search task where the goal is to identify people who are knowledgeable on a speciﬁc topic. Existing language modeling techniques for expert ﬁnding assume that terms and person entities are conditionally independent given a document. We present theoretical and experimental evidence that this simplifying assumption ignores information on how named entities relate to document content. To address this issue, we propose a new document representation whi...

Desislava Petkova, W. Bruce Croft

Real-time Traffic

CIKM 2007 | Entities | Information Management | Person Entities | Retrieval Performance |

claim paper

Post Info
More Details (n/a)

Added	18 Oct 2010
Updated	18 Oct 2010
Type	Conference
Year	2007
Where	CIKM
Authors	Desislava Petkova, W. Bruce Croft

Comments (0)

Sciweavers

Proximity-based document representation for named entity retrieval

CIKM 2007 | Entities | Information Management | Person Entities | Retrieval Performance |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers