Sciweavers

23 search results - page 4 / 5
» Optical Font Recognition for Multi-Font OCR and Document Pro...
Sort
View
EMNLP
2010
13 years 5 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
ICDAR
2005
IEEE
14 years 28 days ago
OCR Based Slide Retrieval
This work addresses the problem of acquiring, indexing and retrieving slides in the context of automatic oral presentation processing. Since the most suitable acquisition techniqu...
N. Daddaoua, Jean-Marc Odobez, Alessandro Vinciare...
ICDAR
2007
IEEE
14 years 1 months ago
Toponym Recognition in Scanned Color Topographic Maps
Topographic paper maps are a common support for geographical information. In the field of document analysis of this kind of support, this paper proposes an automatic approach to ...
Joachim Pouderoux, Jean-Christophe Gonzato, A. Per...
AND
2010
13 years 5 months ago
Document: a useful level for facing noisy data
In this paper we will present a set of experiments using large digitalized collections of books to show that logical structures can be extracted with good quality when working at ...
Hervé Déjean, Jean-Luc Meunier
CIKM
2001
Springer
13 years 12 months ago
Improved String Matching Under Noisy Channel Conditions
Many document-based applications, including popular Web browsers, email viewers, and word processors, have a ‘Find on this Page’ feature that allows a user to find every occur...
Kevyn Collins-Thompson, Charles Schweizer, Susan T...