An Architecture for Finding Entities on the Web

16 years 1 months ago

Download www.l3s.de

Abstract—Recent progress in research ﬁelds such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools. Keywords-entity retrieval; web search; natural language processing;

Gianluca Demartini, Claudiu S. Firan, Mihai George

Real-time Traffic

Entity Centric Index | Human Computer Interaction | Internet Technology | Large-scale Entity Retrieval | LAWEB 2009 | Web Document |

claim paper

» Using Propagation of Distrust to Find Untrustworthy Web Neighborhoods

» Ranking Entities Using Web Search Query Logs

» From Web Data to Entities and Back

» Finding authoritative people from the web

» A General Architecture for Finding Structural Regularities on the Web

» OKKAM Enabling a Web of Entities

» Answering relationship queries on the web

» Who is Who and What is What Experiments in CrossDocument CoReference

Post Info
More Details (n/a)

Added	24 May 2010
Updated	24 May 2010
Type	Conference
Year	2009
Where	LAWEB
Authors	Gianluca Demartini, Claudiu S. Firan, Mihai Georgescu, Tereza Iofciu, Ralf Krestel, Wolfgang Nejdl

Comments (0)

Sciweavers

An Architecture for Finding Entities on the Web

Entity Centric Index | Human Computer Interaction | Internet Technology | Large-scale Entity Retrieval | LAWEB 2009 | Web Document |

Explore & Download

Productivity Tools

Sciweavers