Sciweavers

WEBDB
1998
Springer

Extracting Patterns and Relations from the World Wide Web

14 years 4 months ago
Extracting Patterns and Relations from the World Wide Web
The World Wide Web is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists maybe scattered across thousands of independent information sources in many di erent formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically. We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample. To test our technique we use it to extract a relation of (author,title) pairs from the World Wide Web.
Sergey Brin
Added 06 Aug 2010
Updated 06 Aug 2010
Type Conference
Year 1998
Where WEBDB
Authors Sergey Brin
Comments (0)