Search Sciweavers | Sciweavers

140

AIRS
2006
Springer

96views Information Technology» more AIRS 2006»

Learning to Separate Text Content and Style for Classification

15 years 6 months ago

Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...

Dell Zhang, Wee Sun Lee

claim paper

Read More »

130

click to vote

CIKM
2006
Springer

138views Information Technology» more CIKM 2006»

A document-centric approach to static index pruning in text retrieval systems

15 years 6 months ago

Download stefan.buettcher.org

We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follows a documentcentric approach to decide whether a posting for a given term shoul...

Stefan Büttcher, Charles L. A. Clarke

claim paper

Read More »

133

Voted

CIKM
2006
Springer

132views Information Technology» more CIKM 2006»

Text classification improved through multigram models

15 years 6 months ago

Download research.microsoft.com

Classification algorithms and document representation approaches are two key elements for a successful document classification system. In the past, much work has been conducted to...

Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen

claim paper

Read More »

120

click to vote

ICDAR
2003
IEEE

149views Document Analysis» more ICDAR 2003»

Rectifying the Bound Document Image Captured by the Camera: A Model Based Approach

15 years 8 months ago

Download www.cse.salford.ac.uk

A model based approach for rectifying the camera image of the bound document has been developed, i.e., the surface of the document is represented by a general cylindrical surface....

Huaigu Cao, Xiaoqing Ding, Changsong Liu

claim paper

Read More »

129

click to vote

SIGIR
2004
ACM

136views Information Technology» more SIGIR 2004»

Constructing a text corpus for inexact duplicate detection

15 years 8 months ago

Download www.conradweb.org

As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. The goal of this work i...

Jack G. Conrad, Cindy P. Schriber

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers