Autonomously semantifying wikipedia

16 years 25 days ago

Download turing.cs.washington.edu

Berners-Lee’s compelling vision of a Semantic Web is hindered by a chicken-and-egg problem, which can be best solved by a bootstrapping method — creating enough structured data to motivate the development of applications. This paper argues that autonomously “Semantifying Wikipedia” is the best way to solve the problem. We choose Wikipedia as an initial data source, because it is comprehensive, not too large, high-quality, and contains enough manuallyderived structure to bootstrap an autonomous, self-supervised process. We identify several types of structures which can be automatically enhanced in Wikipedia (e.g., link structure, taxonomic data, infoboxes, etc.), and we describe a prototype implementation of a self-supervised, machine learning system which realizes our vision. Preliminary experiments demonstrate the high precision of our system’s extracted data — in one case equaling that of humans. Categories and Subject Descriptors H.4 [Information Systems Applications]: ...

Fei Wu, Daniel S. Weld

Real-time Traffic

Berners-Lee’s Compelling Vision | CIKM 2007 | Initial Data Source | Semantic Web |

claim paper

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	CIKM
Authors	Fei Wu, Daniel S. Weld

Sciweavers

Autonomously semantifying wikipedia

Berners-Lee’s Compelling Vision | CIKM 2007 | Initial Data Source | Semantic Web |

Explore & Download

Productivity Tools

Sciweavers