Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

137

BTW
2007
Springer

122views Database» more BTW 2007»

YAWN: A Semantically Annotated Wikipedia XML Corpus

16 years 1 months ago

YAWN: A Semantically Annotated Wikipedia XML Corpus

Download www.btw2007.de

: The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.

Ralf Schenkel, Fabian M. Suchanek, Gjergji Kasneci

Real-time Traffic

BTW 2007 | Extracts Additional Information | Paper Presents Yawn | WordNet Thesaurus |

claim paper

Related Content

» Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus

» Semantically Annotated Snapshot of the English Wikipedia

» WikiWoods SyntactoSemantic Annotation for English Wikipedia

» Wikicorpus A WordSense Disambiguated Multilingual Wikipedia Corpus

» Annotating wikipedia articles with semantic tags for structured retrieval

» Coarse Lexical Semantic Annotation with Supersenses An Arabic Case Study

» Learning to Tag and Tagging to Learn A Case Study on Wikipedia

» A Corpus Representation Format for Linguistic Web Services The DSPIN Text Corpus Format an...

» ANAWIKI Creating Anaphorically Annotated Resources through Web Cooperation

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	BTW
Authors	Ralf Schenkel, Fabian M. Suchanek, Gjergji Kasneci

Comments (0)