Sciweavers

ICDAR
2003
IEEE
14 years 5 months ago
Character Recognition by Adaptive Statistical Similarity
Handwriting recognition and OCR systems need to cope with a wide variety of writing styles and fonts, many of them possibly not previously encountered during training. This paper d...
Thomas M. Breuel
ICDAR
2003
IEEE
14 years 5 months ago
A Novel Approach to Separate Handwritten Connected Digits
This paper presents a novel approach to separate connected digits in handwritten numerals by employing two agents in the process. The first agent decides on candidate cut-point a...
Reda Alhajj, Ashraf Elnagar
ICDAR
2003
IEEE
14 years 5 months ago
Structured and Unstructured Document Summarization: Design of a Commercial Summarizer using Lexical Chains
The process of summarizing documents is becoming increasingly important in the light of recent advances in document creation/distribution technology, and the resulting influx of l...
Hassan Alam, Aman Kumar, Mikako Nakamura, Ahmad Fu...
ICDAR
2003
IEEE
14 years 5 months ago
Web Page Summarization for Handheld Devices: A Natural Language Approach
Summarization of web pages is a very interesting topic from both academic and commercial point of view. Academically, it is challenging to create a summary of a document (e.g. a w...
Hassan Alam, Rachmat Hartono, Aman Kumar, Ahmad Fu...
ICDAR
2003
IEEE
14 years 5 months ago
Evaluating SEE - A Benchmarking System for Document Page Segmentation
The decomposition of a document into segments such as text regions and graphics is a significant part of the document analysis process. The basic requirement for rating and impro...
Stefan Agne, Andreas Dengel, Bertin Klein
ICDAR
2003
IEEE
14 years 5 months ago
Automatic Feature Selection with Applications to Script Identification of Degraded Documents
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...
Vitaly Ablavsky, Mark R. Stevens
ICDAR
2003
IEEE
14 years 5 months ago
Using tree-grammars for training set expansion in page classification
Stefano Baldi, Simone Marinai, Giovanni Soda
NLDB
2004
Springer
14 years 5 months ago
A Flexible Workbench for Document Analysis and Text Mining
Abstract: Document analysis and text mining techniques are used to preprocess documents in information retrieval systems, to extract concepts in ontology construction processes, an...
Jon Atle Gulla, Terje Brasethvik, Harald Kaada
DAS
2004
Springer
14 years 5 months ago
Multi-component Document Image Coding Using Regions-of-Interest
Xiao Wei Yin, Andy C. Downton, Martin Fleury, Jing...
DAS
2004
Springer
14 years 5 months ago
Rule-Based Structural Analysis of Web Pages
Structural analysis of web pages has been proposed several times and for a number of reasons and purposes, such as the re-flowing of standard web pages to fit a smaller PDA screen....
Fabio Vitali, Angelo Di Iorio, Elisa Ventura Campo...