Sciweavers

64 search results - page 10 / 13
» Entity summarization of news articles
Sort
View
ACL
2003
13 years 10 months ago
tRuEcasIng
Truecasing is the process of restoring case information to badly-cased or noncased text. This paper explores truecasing issues and proposes a statistical, language modeling based ...
Lucian Vlad Lita, Abraham Ittycheriah, Salim Rouko...
ICML
2010
IEEE
13 years 9 months ago
A Language-based Approach to Measuring Scholarly Impact
Identifying the most influential documents in a corpus is an important problem in many fields, from information science and historiography to text summarization and news aggregati...
Sean Gerrish, David M. Blei
LREC
2008
106views Education» more  LREC 2008»
13 years 10 months ago
A Corpus for Cross-Document Co-reference
This paper describes a newly created text corpus of news articles that has been annotated for cross-document co-reference. Being able to robustly resolve references to entities ac...
David Day, Janet Hitzeman, Michael L. Wick, Keith ...
COLING
2010
13 years 3 months ago
Resolving Surface Forms to Wikipedia Topics
Ambiguity of entity mentions and concept references is a challenge to mining text beyond surface-level keywords. We describe an effective method of disambiguating surface forms an...
Yiping Zhou, Lan Nie, Omid Rouhani-Kalleh, Flavian...
DEXAW
2010
IEEE
202views Database» more  DEXAW 2010»
13 years 9 months ago
Identifying Sentence-Level Semantic Content Units with Topic Models
Abstract--Statistical approaches to document content modeling typically focus either on broad topics or on discourselevel subtopics of a text. We present an analysis of the perform...
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ern...