Popularity-Guided Top-k Extraction of Entity Attributes

16 years 3 days ago

Download webdb2010.org

Recent progress in information extraction technology has enabled a vast array of applications that rely on structured data that is embedded in natural-language text. In particular, the extraction of concepts from the Web—with their desired attributes—is important to provide applications with rich, structured access to information. In this paper, we focus on an important family of concepts, namely, entities (e.g., people or organizations) and their attributes, and study how to eﬃciently and eﬀectively extract them from Web-accessible text documents. Unfortunately, information extraction over the Web is challenging for both quality and eﬃciency reasons. Regarding quality, many sources on the Web contain misleading or invalid information; furthermore, extraction systems often return incorrect data. Regarding eﬃciency, information extraction is a time-consuming process, often involving expensive text-processing steps. We present a top-k extraction processing approach that addr...

Matthew Solomon, Cong Yu, Luis Gravano

Real-time Traffic

Extraction Processing Approach | Information Extraction | Internet Technology | Top-k Extraction Processing | WEBDB 2010 |

claim paper

Post Info
More Details (n/a)

Added	11 Jul 2010
Updated	11 Jul 2010
Type	Conference
Year	2010
Where	WEBDB
Authors	Matthew Solomon, Cong Yu, Luis Gravano

Comments (0)

Sciweavers

Popularity-Guided Top-k Extraction of Entity Attributes

Extraction Processing Approach | Information Extraction | Internet Technology | Top-k Extraction Processing | WEBDB 2010 |

Explore & Download

Productivity Tools

Sciweavers