DRR 2003 | Sciweavers

176

DRR
2003

98views Document Analysis» more DRR 2003»

Information retrieval for OCR documents: a content-based probabilistic correction model

15 years 8 months ago

The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most informat...

Rong Jin, ChengXiang Zhai, Alexander G. Hauptmann

claim paper

Read More »

171

click to vote

DRR
2003

152views Document Analysis» more DRR 2003»

Document structure analysis algorithms: a literature survey

15 years 8 months ago

Download archive.nlm.nih.gov

Document structure analysis can be regarded as a syntactic analysis problem. The order and containment relations among the physical or logical components of a document page can be...

Song Mao, Azriel Rosenfeld, Tapas Kanungo

claim paper

Read More »

144

click to vote

DRR
2003

101views Document Analysis» more DRR 2003»

Header and footer extraction by page association

15 years 8 months ago

Download www.hpl.hp.com

Xiaofan Lin

claim paper

Read More »

194

click to vote

DRR
2003

102views Document Analysis» more DRR 2003»

Automated labeling of bibliographic data extracted from biomedical online journals

15 years 8 months ago

Download lhncbc.nlm.nih.gov

A prototype system has been designed to automate the extraction of bibliographic data (e.g., article title, authors, , affiliation and others) from online biomedical journals to p...

Jongwoo Kim, Daniel X. Le, George R. Thoma

claim paper

Read More »

191

click to vote

DRR
2003

117views Document Analysis» more DRR 2003»

Correcting OCR text by association with historical datasets

15 years 8 months ago

Download lhncbc.nlm.nih.gov

The Medical Article Records System (MARS) developed by the Lister Hill National Center for Biomedical Communications uses scanning, OCR and automated recognition and reformatting ...

Susan E. Hauser, Jonathan Schlaifer, Tehseen F. Sa...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers