Sciweavers

8316 search results - page 136 / 1664
» Web Document Modeling
Sort
View
106
Voted
ICPR
2010
IEEE
15 years 5 months ago
Topic-Sensitive Tag Ranking
Social tagging is an increasingly popular way to describe and classify documents on the web. However, the quality of the tags varies considerably since the tags are authored freel...
Yan'An Jin, Ruixuan Li, Zhengding Lu, Kunmei Wen, ...
COLING
2010
14 years 9 months ago
Disambiguating Dynamic Sentiment Ambiguous Adjectives
Dynamic sentiment ambiguous adjectives (DSAAs) like "large, small, high, low"pose a challenging task on sentiment analysis. This paper proposes a knowledge-based method ...
Yunfang Wu, Miaomiao Wen
126
Voted
ADCS
2004
15 years 4 months ago
Co-Training on Textual Documents with a Single Natural Feature Set
Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...
Jason Chan, Irena Koprinska, Josiah Poon
121
Voted
ICDAR
2009
IEEE
15 years 8 days ago
Learning on the Fly: Font-Free Approaches to Difficult OCR Problems
Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem," many categories of documents continue to break modern OCR software such as docu...
Andrew Kae, Erik G. Learned-Miller
MAICS
2004
15 years 3 months ago
Intelligent Content Based Title and Author Name Extraction from Formatted Documents
This paper describes the development of algorithms for extracting the title and the names of the authors from documents available on the World Wide Web. In this paper we describe ...
Eric G. Berkowitz, Mohamed Reda Elkhadiri, Tim Sah...