Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

ICDAR
2009
IEEE

214views Document Analysis» more ICDAR 2009»

Metadata Extraction from PDF Papers for Digital Library Ingest

16 years 1 months ago

Metadata Extraction from PDF Papers for Digital Library Ingest

Download www.cvc.uab.es

In this paper we analyze our recent research on the use of document analysis techniques for metadata extraction from PDF papers. We describe a package that is designed to extract basic metadata from these documents. The package is used in combination with a digital library software suite to easily build personal digital libraries. The proposed software is based on a suitable combination of several techniques that include PDF parsing, low level document image processing, and layout analysis. In addition, we use the information gathered from a widely known citation database (DBLP) to assist the tool in the difﬁcult task of author identiﬁcation. The system is tested on some paper collections selected from recent conference proceedings.

Simone Marinai

Real-time Traffic

Document Analysis | Document Analysis Techniques | ICDAR 2009 | Metadata Extraction | Personal Digital Libraries |

claim paper

Related Content

» Retrieving Metadata for Your Local Scholarly Papers

» Automatic extraction of table metadata from digital documents

» A Blueprint for Representation Information in the OAIS Model

» ChemXSeer a digital library and data repository for chemical kinetics

» Extracting scientific articles from a large digital archive BioStor and the Biodiversity H...

» Automatic Document Metadata Extraction Using Support Vector Machines

» Hebbian Algorithms for a Digital Library Recommendation System

» A Dynamic Feature Generation System for Automated Metadata Extraction in Preservation of D...

» Extracting Author MetaData from Web Using Visual Features

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICDAR
Authors	Simone Marinai

Comments (0)