Sciweavers

76 search results - page 12 / 16
» Locating Charts from Scanned Document Pages
Sort
View
ICDAR
1999
IEEE
14 years 4 hour ago
Preattentive Reading and Selective Attention for Document Image Analysis
PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The...
Claudie Faure
IADIS
2004
13 years 9 months ago
Web Document Classification: Managing Context Change
This paper focuses on the information management systems of the dynamic World Wide Web. Many individual web pages, such as news portals, provide periodic information and public an...
Sung Sik Park, Yang Sok Kim, Byeong Ho Kang
ICDAR
2003
IEEE
14 years 1 months ago
Lexical Postcorrection of OCR-Results: The Web as a Dynamic Secondary Dictionary?
Postcorrection of OCR-results for text documents is usually based on electronic dictionaries. When scanning texts from a specific thematic area, conventional dictionaries often m...
Christian M. Strohmaier, Christoph Ringlstetter, K...
DEXA
2005
Springer
109views Database» more  DEXA 2005»
14 years 1 months ago
An XML Approach to Semantically Extract Data from HTML Tables
Abstract. Data intensive information is often published on the internet in the format of HTML tables. Extracting some of the information that is of users’ interest from the inter...
Jixue Liu, Zhuoyun Ao, Ho-Hyun Park, Yongfeng Chen
ICDAR
2003
IEEE
14 years 1 months ago
Progress in Camera-Based Document Image Analysis
The increasing availability of high performance, low priced, portable digital imaging devices has created a tremendous opportunity for supplementing traditional scanning for docum...
David S. Doermann, Jian Liang, Huiping Li