Sciweavers

76 search results - page 8 / 16
» Locating Charts from Scanned Document Pages
Sort
View
ICDAR
2003
IEEE
14 years 1 months ago
Correcting Document Image Warping Based on Regression of Curved Text Lines
Image warping is a common problem when one scans or photocopies a document page from a thick bound volume, resulting in shading and curved text lines in the spine area of the boun...
Zheng Zhang 0003, Chew Lim Tan
DGO
2010
173views Education» more  DGO 2010»
13 years 9 months ago
Digital sustainable publication of legacy parliamentary proceedings
We address the problem of publishing parliamentary proceedings in a digital sustainable manner. We give an extensive requirements analysis, and based on that propose a uniform XML...
Maarten Marx, Nelleke Aders, Anne Schuth
DOCENG
2009
ACM
14 years 2 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
ICDM
2002
IEEE
138views Data Mining» more  ICDM 2002»
14 years 20 days ago
Extraction Techniques for Mining Services from Web Sources
The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services...
Hasan Davulcu, Saikat Mukherjee, I. V. Ramakrishna...
BMCBI
2011
12 years 11 months ago
Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library
Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...
Roderic D. M. Page