Sciweavers

2827 search results - page 92 / 566
» Marking Text Documents
Sort
View
AIRS
2006
Springer
13 years 12 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee
CIKM
2006
Springer
13 years 12 months ago
A document-centric approach to static index pruning in text retrieval systems
We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follows a documentcentric approach to decide whether a posting for a given term shoul...
Stefan Büttcher, Charles L. A. Clarke
CIKM
2006
Springer
13 years 12 months ago
Text classification improved through multigram models
Classification algorithms and document representation approaches are two key elements for a successful document classification system. In the past, much work has been conducted to...
Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen
ICDAR
2003
IEEE
14 years 1 months ago
Rectifying the Bound Document Image Captured by the Camera: A Model Based Approach
A model based approach for rectifying the camera image of the bound document has been developed, i.e., the surface of the document is represented by a general cylindrical surface....
Huaigu Cao, Xiaoqing Ding, Changsong Liu
SIGIR
2004
ACM
14 years 1 months ago
Constructing a text corpus for inexact duplicate detection
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. The goal of this work i...
Jack G. Conrad, Cindy P. Schriber