Search Sciweavers | Sciweavers

843 search results - page 109 / 169

» Segmentation of Compressed Documents

199

click to vote

ICDAR
1997
IEEE

143views Document Analysis» more ICDAR 1997»

Representing OCRed documents in HTML

15 years 11 months ago

Download www.cedar.buffalo.edu

ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...

Tao Hong, Sargur N. Srihari

claim paper

Read More »

234

click to vote

ECCV
2008
Springer

185views Computer Vision» more ECCV 2008»

Learning Visual Shape Lexicon for Document Image Content Recognition

16 years 9 months ago

Download lampsrv02.umiacs.umd.edu

Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content catego...

Guangyu Zhu, Xiaodong Yu, Yi Li, David S. Doermann

claim paper

Read More »

206

click to vote

HT
2005
ACM

133views Internet Technology» more HT 2005»

As we may perceive: inferring logical documents from hypertext

16 years 19 days ago

Download www.cs.cornell.edu

In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...

Pavel Dmitriev, Carl Lagoze, Boris Suchkov

claim paper

Read More »

228

click to vote

SIGIR
2003
ACM

130views Information Technology» more SIGIR 2003»

Domain-independent text segmentation using anisotropic diffusion and dynamic programming

16 years 10 days ago

Download www-connex.lip6.fr

This paper presents a novel domain-independent text segmentation method, which identiﬁes the boundaries of topic changes in long text documents and/or text streams. The method c...

Xiang Ji, Hongyuan Zha

claim paper

Read More »

202

click to vote

CVPR
2010
IEEE

327views Computer Vision» more CVPR 2010»

Improving State-of-the-Art OCR through High-Precision Document-Specific Modeling

16 years 3 months ago

Download vis-www.cs.umass.edu

Optical character recognition (OCR) remains a difficult problem for noisy documents or documents not scanned at high resolution. Many current approaches rely on stored font models...

Andrew Kae, Gary Huang, Erik Learned-miller, Carl ...

claim paper

Read More »

« Prev « First page 109 / 169 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers