Search Sciweavers | Sciweavers

82 search results - page 2 / 17

» A search engine for imaged documents in PDF files

click to vote

DAS
2006
Springer

129views Document Analysis» more DAS 2006»

A System for Converting PDF Documents into Structured XML Format

14 years 1 months ago

Download www.xrce.xerox.com

We present in this paper a system for converting PDF legacy documents into structured XML format. This conversion system first extracts the different streams contained in PDF files...

Hervé Déjean, Jean-Luc Meunier

claim paper

Read More »

click to vote

DAS
2006
Springer

202views Document Analysis» more DAS 2006»

XCDF: A Canonical and Structured Document Format

13 years 11 months ago

Download www.bloechle.ch

Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...

Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...

claim paper

Read More »

click to vote

ICDIM
2006
IEEE

97views Information Technology» more ICDIM 2006»

A Framework for the Encoding of Multilayered Documents

14 years 3 months ago

Download www.bibalex.org

Electronic publishing of material digitized using imaging and OCR calls for a special delivery format capable of reconstructing original documents in a well-usable electronic form...

Youssef Eldakar, Noha Adly, Magdy Nagi

claim paper

Read More »

click to vote

ERCIMDL
2010
Springer

180views Education» more ERCIMDL 2010»

SciPlore Xtract: Extracting Titles from Scientific PDF Documents by Analyzing Style Information (Font Size)

13 years 7 months ago

Download www.sciplore.org

Extracting titles from a PDFs full text is an important task in information retrieval to identify PDFs. Existing approaches apply complicated and expensive (in terms of calculating...

Jöran Beel, Bela Gipp, Ammar Shaker, Nick Fri...

claim paper

Read More »

click to vote

LREC
2008

113views Education» more LREC 2008»

Integration of a Multilingual Keyword Extractor in a Document Management System

13 years 11 months ago

Download www.lrec-conf.org

In this paper we present a new Document Management System called DrStorage. This DMS is multi-platform, JCR-170 compliant, supports WebDav, versioning, user authentication and aut...

Andrea Agili, Marco Fabbri, Alessandro Panunzi, Ma...

claim paper

Read More »

« Prev « First page 2 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers