Search Sciweavers | Sciweavers

103 search results - page 4 / 21

» Models and Algorithms for Duplicate Document Detection

189

click to vote

SIGIR
2006
ACM

84views Information Technology» more SIGIR 2006»

Near-duplicate detection by instance-level constrained clustering

16 years 19 days ago

Download www.cs.cmu.edu

For the task of near-duplicated document detection, both traditional fingerprinting techniques used in database community and bag-of-word comparison approaches used in information...

Hui Yang, James P. Callan

claim paper

Read More »

171

click to vote

ICDAR
2005
IEEE

112views Document Analysis» more ICDAR 2005»

A Model for Detecting and Merging Vertically Spanned Table Cells in Plain Text Documents

16 years 9 days ago

Download web.science.mq.edu.au

A spanned cell in a table is a single, complete unit that physically occupies multiple columns and/or multiple rows. Spanned cells are common in tables, and they are a signiﬁcan...

Vanessa Long, Robert Dale, Steve Cassidy

claim paper

Read More »

169

click to vote

ICDAR
2003
IEEE

124views Document Analysis» more ICDAR 2003»

A Line Drawings Degradation Model for Performance Characterization

15 years 12 months ago

Download www.cse.salford.ac.uk

Line detection algorithms constitute the basis for technical document analysis and recognition. The performance of these algorithms decreases as the quality of the documents degra...

Jian Zhai, Liu Wenyin, Dov Dori, Qing Li

claim paper

Read More »

190

click to vote

CIVR
2007
Springer

273views Image Analysis» more CIVR 2007»

Scalable near identical image and shot detection

16 years 26 days ago

Download cmp.felk.cvut.cz

This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The ﬁrst approach is based on global hierarchical colour histograms, using ...

Ondrej Chum, James Philbin, Michael Isard, Andrew ...

claim paper

Read More »

191

click to vote

ICDAR
2003
IEEE

107views Document Analysis» more ICDAR 2003»

A Model-based Line Detection Algorithm in Documents

15 years 12 months ago

Download www.cse.salford.ac.uk

In this paper we present a novel model based approach to detect severely broken parallel lines in noisy textual documents. It is important to detect and remove these lines so the ...

Yefeng Zheng, Huiping Li, David S. Doermann

claim paper

Read More »

« Prev « First page 4 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers