A New Approach towards Bibliographic Reference Identification, Parsing and Inline Citation Matching

14 years 4 months ago

Download plazi.org

A number of algorithms and approaches have been proposed towards the problem of scanning and digitizing research papers. We can classify work done in the past into three major approaches: regular expression based heuristics, learning based algorithm and knowledge based systems. Our findings point to the inadequacy of existing open-source solutions such as Paracite for papers with "micro-citations" in various European Languages. This paper describes the work done as part of the Google Summer of Code 2008 using a combination of regular-expression based heuristics and knowledge-based systems to develop a system which matches inline citations to their corresponding bibliographic references and identifies and extracts metadata from references. The description, implementation and results of our approach have been presented here. Our approach enhances the accuracy and provides better recognition rates.

Deepank Gupta, Bob Morris, Terry Catapano, Guido S

Real-time Traffic

Applied Computing | Corresponding Bibliographic References | IC3 2009 | Regular-expression Based Heuristics | Various European Languages |

claim paper

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	IC3
Authors	Deepank Gupta, Bob Morris, Terry Catapano, Guido Sautter

Comments (0)

Sciweavers

A New Approach towards Bibliographic Reference Identification, Parsing and Inline Citation Matching

Applied Computing | Corresponding Bibliographic References | IC3 2009 | Regular-expression Based Heuristics | Various European Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers