Search Sciweavers | Sciweavers

143

ECIR
2009
Springer

105views Information Technology» more ECIR 2009»

Revisiting N-Gram Based Models for Retrieval in Degraded Large Collections

16 years 3 months ago

The traditional retrieval models based on term matching are not eﬀective in collections of degraded documents (output of OCR or ASR systems for instance). This paper presents a n...

Javier Parapar, Ana Freire, Alvaro Barreiro

claim paper

Read More »

145

click to vote

ICDAR
2003
IEEE

124views Document Analysis» more ICDAR 2003»

A Case Restoration Approach to Named Entity Tagging in Degraded Documents

15 years 11 months ago

Download www.cse.salford.ac.uk

This paper describes a novel approach to named entity (NE) tagging on degraded documents. NE tagging is the process of identifying salient text strings in unstructured text, corre...

Rohini K. Srihari, Cheng Niu, Wei Li, Jihong Ding

claim paper

Read More »

126

click to vote

ERCIMDL
2009
Springer

117views Education» more ERCIMDL 2009»

Improving OCR Accuracy for Classical Critical Editions

16 years 21 days ago

Download www.perseus.tufts.edu

This paper describes a work-ﬂow designed to populate a digital library of ancient Greek critical editions with highly accurate OCR scanned text. While the most recently available...

Federico Boschetti, Matteo Romanello, Alison Babeu...

claim paper

Read More »

162

Voted

ICDAR
2009
IEEE

165views Document Analysis» more ICDAR 2009»

Learning on the Fly: Font-Free Approaches to Difficult OCR Problems

15 years 3 months ago

Download www.cs.umass.edu

Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem," many categories of documents continue to break modern OCR software such as docu...

Andrew Kae, Erik G. Learned-Miller

claim paper

Read More »

142

Voted

ICPR
2006
IEEE

148views computer vision» more ICPR 2006»

CAPTCHA Challenge Tradeoffs: Familiarity of Strings versus Degradation of Images

16 years 7 months ago

Download www.cse.lehigh.edu

It is a well documented fact that, for human readers, familiar text is more legible than unfamiliar text. Current-generation computer vision systems also are able to exploit some ...

Jon Louis Bentley, Sui-Yu Wang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers