Sciweavers

898 search results - page 93 / 180
» Making Documents Work: Challenges for Document Understanding
Sort
View
PRICAI
2004
Springer
15 years 9 months ago
Coherent Arrangement of Sentences Extracted from Multiple Newspaper Articles
Multi-document summarization is a challenge to information overload problem to provide a condensed text for a number of documents. Most multi-document summarization systems make u...
Naoaki Okazaki, Yutaka Matsuo, Mitsuru Ishizuka
CORR
2002
Springer
96views Education» more  CORR 2002»
15 years 3 months ago
Thumbs up? Sentiment Classification using Machine Learning Techniques
We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, w...
Bo Pang, Lillian Lee, Shivakumar Vaithyanathan
SIGMOD
2009
ACM
269views Database» more  SIGMOD 2009»
16 years 4 months ago
Efficient approximate entity extraction with edit distance constraints
Named entity recognition aims at extracting named entities from unstructured text. A recent trend of named entity recognition is finding approximate matches in the text with respe...
Wei Wang 0011, Chuan Xiao, Xuemin Lin, Chengqi Zha...
EUROSYS
2009
ACM
16 years 1 months ago
Isolating web programs in modern browser architectures
Many of today’s web sites contain substantial amounts of client-side code, and consequently, they act more like programs than simple documents. This creates robustness and perfo...
Charles Reis, Steven D. Gribble
SOUPS
2009
ACM
15 years 10 months ago
Sanitization's slippery slope: the design and study of a text revision assistant
For privacy reasons, sensitive content may be revised before it is released. The revision often consists of redaction, that is, the “blacking out” of sensitive words and phras...
Richard Chow, Ian Oberst, Jessica Staddon