

TAGME: on-the-fly annotation of short text fragments (by Wikipedia entities)

14 years 22 days ago
TAGME: on-the-fly annotation of short text fragments (by Wikipedia entities)
We designed and implemented Tagme, a system that is able to efficiently and judiciously augment a plain-text with pertinent hyperlinks to Wikipedia pages. The specialty of Tagme with respect to known systems [5, 8] is that it may annotate texts which are short and poorly composed, such as snippets of search-engine results, tweets, news, etc.. This annotation is extremely informative, so any task that is currently addressed using the bag-of-words paradigm could benefit from using this annotation to draw upon (the millions of) Wikipedia pages and their inter-relations. Categories and Subject Descriptors I.2.7 [Artificial Intelligence]: Natural Language Processing--text analysis; H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing General Terms Algorithms, Experimentation, Performance.
Paolo Ferragina, Ugo Scaiella
Added 09 Dec 2010
Updated 09 Dec 2010
Type Journal
Year 2010
Where CORR
Authors Paolo Ferragina, Ugo Scaiella
Comments (0)