PDF documents | Sciweavers

167

ICDAR
2009
IEEE

191views Document Analysis» more ICDAR 2009»

OCD: An Optimized and Canonical Document Format

15 years 4 months ago

Revealing and being able to manipulate the structured content of PDF documents is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we ...

Jean-Luc Bloechle, Denis Lalanne, Rolf Ingold

claim paper

Read More »

162

click to vote

ICMLA
2008

116views Machine Learning» more ICMLA 2008»

Text, Image and Vector Graphics Based Appraisal of Contemporary Documents

15 years 8 months ago

Download isda.ncsa.uiuc.edu

We have designed a framework for content based appraisal of documents. Our motivation is to provide computer assisted support for answering several appraisal criteria according to...

Sang-Chul Lee, William McFadden, Peter Bajcsy

claim paper

Read More »

174

click to vote

DIAL
2006
IEEE

130views Image Analysis» more DIAL 2006»

Refinement of digitized documents through recognition of mathematical formulae

15 years 10 months ago

Download www.inftyproject.org

We are developing a recognition system, named `Infty', for scientific documents including those with mathematical formulae. In this paper, we propose a new system that can re...

Toshihiro Kanahori, Masakazu Suzuki

claim paper

Read More »

155

click to vote

DIAL
2004
IEEE

156views Image Analysis» more DIAL 2004»

Xed: A New Tool for eXtracting Hidden Structures from Electronic Documents

15 years 10 months ago

Download diuf.unifr.ch

PDF became a very common format for exchanging printable documents. Further, it can be easily generated from the major documents formats, which make a huge number of PDF documents...

Karim Hadjar, Maurizio Rigamonti, Denis Lalanne, R...

claim paper

Read More »

168

click to vote

DOCENG
2004
ACM

169views Document Analysis» more DOCENG 2004»

Creating structured PDF files using XML templates

16 years 3 hour ago

Download eprints.nottingham.ac.uk

This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML represe...

Matthew R. B. Hardy, David F. Brailsford, Peter L....

claim paper

Read More »

171

click to vote

DOCENG
2009
ACM

166views Document Analysis» more DOCENG 2009»

Object-level document analysis of PDF files

16 years 1 months ago

Download www.dbai.tuwien.ac.at

The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...

Tamir Hassan

claim paper

Read More »

166

Voted

MKM
2009
Springer

137views Information Technology» more MKM 2009»

A Linear Grammar Approach to Mathematical Formula Recognition from PDF

16 years 1 months ago

Download www.cs.bham.ac.uk

Many approaches have been proposed over the years for the recognition of mathematical formulae from scanned documents. More recently a need has arisen to recognise formulae from PD...

Josef B. Baker, Alan P. Sexton, Volker Sorge

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers