document analysis | Sciweavers

231

DAS
2008
Springer

102views Document Analysis» more DAS 2008»

The Convergence of Iterated Classification

15 years 9 months ago

We report an improved methodology for training a sequence of classifiers for document image content extraction, that is, the location and segmentation of regions containing handwr...

Chang An, Henry S. Baird

claim paper

Read More »

199

click to vote

DAS
2008
Springer

129views Document Analysis» more DAS 2008»

A Graphics Image Processing System

15 years 9 months ago

Download www.comp.nus.edu.sg

Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it d...

Linlin Li, Chew Lim Tan

claim paper

Read More »

197

click to vote

DOCENG
2007
ACM

105views Document Analysis» more DOCENG 2007»

Editing with style

15 years 9 months ago

Download hal.archives-ouvertes.fr

HTML has popularized the use of style sheets, and the advent of XML has stressed the importance of style as a key area complementing document structure and content. A number of to...

Vincent Quint, Irène Vatton

claim paper

Read More »

209

click to vote

DAS
2010
Springer

251views Document Analysis» more DAS 2010»

Overlapped text segmentation using Markov random field and aggregation

15 years 9 months ago

Download www.visionopen.com

Separating machine printed text and handwriting from overlapping text is a challenging problem in the document analysis field and no reliable algorithms have been developed thus f...

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...

claim paper

Read More »

204

click to vote

DAS
2010
Springer

168views Document Analysis» more DAS 2010»

Investigator name recognition from medical journal articles: a comparative study of SVM and structural SVM

15 years 9 months ago

Download lhncbc.nlm.nih.gov

Automated extraction of bibliographic information from journal articles is key to the affordable creation and maintenance of citation databases, such as MEDLINE

Xiaoli Zhang, Jie Zou, Daniel X. Le, George R. Tho...

claim paper

Read More »

190

click to vote

DOCENG
2005
ACM

129views Document Analysis» more DOCENG 2005»

Managing syntactic variation in text retrieval

15 years 9 months ago

Download coleweb.dc.fi.udc.es

Information Retrieval systems are limited by the linguistic variation of language. The use of Natural Language Processing techniques to manage this problem has been studied for a ...

Jesús Vilares, Carlos Gómez-Rodr&iac...

claim paper

Read More »

217

Voted

DOCENG
2005
ACM

121views Document Analysis» more DOCENG 2005»

Generative semantic clustering in spatial hypertext

15 years 9 months ago

Download ecologylab.cs.tamu.edu

This paper presents an iterative method for generative semantic clustering of related information elements in spatial hypertext documents. The goal is to automatically organize th...

Andruid Kerne, Eunyee Koh, Vikram Sundaram, J. Mic...

claim paper

Read More »

191

click to vote

DOCENG
2005
ACM

100views Document Analysis» more DOCENG 2005»

Enhancing composite digital documents using XML-based standoff markup

15 years 9 months ago

Download eprints.nottingham.ac.uk

Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging tract structure to detailed rendering and layout. We pres...

Peter L. Thomas, David F. Brailsford

claim paper

Read More »

157

click to vote

DOCENG
2005
ACM

99views Document Analysis» more DOCENG 2005»

A web-based document harmonization and annotation chain: from PDF to RDF

15 years 9 months ago