Automatically refining the wikipedia infobox ontology

16 years 8 months ago

Download www2008.org

The combined efforts of human volunteers have recently extracted numerous facts from Wikipedia, storing them as machine-harvestable object-attribute-value triples in Wikipedia infoboxes. Machine learning systems, such as Kylin, use these infoboxes as training data, accurately extracting even more semantic knowledge from natural language text. But in order to realize the full power of this information, it must be situated in a cleanly-structured ontology. This paper introduces KOG, an autonomous system for refining Wikipedia's infobox-class ontology towards this end. We cast the problem of ontology refinement as a machine learning problem and solve it using both SVMs and a more powerful joint-inference approach expressed in Markov Logic Networks. We present experiments demonstrating the superiority of the joint-inference approach and evaluating other aspects of our system. Using these techniques, we build a rich ontology, integrating Wikipedia's infobox-class schemata with Wo...

Fei Wu, Daniel S. Weld

Real-time Traffic

Cleanly-structured Ontology | Internet Technology | Ontology Refinement | Rich Ontology | WWW 2008 |

claim paper

Added	21 Nov 2009
Updated	21 Nov 2009
Type	Conference
Year	2008
Where	WWW
Authors	Fei Wu, Daniel S. Weld

Sciweavers

Automatically refining the wikipedia infobox ontology

Cleanly-structured Ontology | Internet Technology | Ontology Refinement | Rich Ontology | WWW 2008 |

Explore & Download

Productivity Tools

Sciweavers