Sciweavers

502 search results - page 21 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
KES
2004
Springer
14 years 1 months ago
Knowledge Extraction from Semi-structured Data Based on Fuzzy Techniques
Abstract. In this work we propose a fuzzy technique to compare XML documents belonging to a semi-structured flow and sharing a common vocabulary of tags. Our approach is based on t...
Paolo Ceravolo, Maria Cristina Nocerino, Marco Viv...
ICDAR
2007
IEEE
13 years 10 months ago
Example-Based Logical Labeling of Document Title Page Images
This paper presents a flexible and effective examplebased approach for labeling title pages which can be used for automated extraction of bibliographic data. The labels of intere...
Joost van Beusekom, Daniel Keysers, Faisal Shafait...
MVA
1990
13 years 9 months ago
Recognition of Document Structure on the Basis of Spatial and Geometric Relationships between Document Items
This paper introduces a new method to extract and classify the meaningful information from documents automatically. The basic idea in our method is to utilize the spatial and geom...
Qin Luo, Toyohide Watanabe, Yuuji Yoshida, Yasuyos...
IAJIT
2008
123views more  IAJIT 2008»
13 years 8 months ago
Vectorial Information Structuring for Documents Filtering and Diffusion
: Information retrieval tries to identify relevant documents for an information need. The problems that an IR system should deal with include document indexing (which tries to extr...
Omar Nouali, Abdelghani Krinah
DIAL
2004
IEEE
181views Image Analysis» more  DIAL 2004»
14 years 5 days ago
Forensic Handwritten Document Retrieval System
Document storage and retrieval capabilities of the CEDAR-FOX forensic handwritten document examination system are described. The system is designed for automated and semi-automate...
Sargur N. Srihari, Zhixin Shi