Sciweavers

249 search results - page 6 / 50
» Classification of Documents Based on the Structure of Their ...
Sort
View
IJCAI
2003
13 years 8 months ago
Expressive Power of Tree and String Based Wrappers
There exist two types of wrappers: the string based wrapper such as the LR wrapper, and the tree based wrapper. A tree based wrapper designates extraction regions by nodes on the ...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa
ICPR
2006
IEEE
14 years 8 months ago
Camera Text Recognition based on Perspective Invariants
As camera resolution increases, high-speed non-contact text capture through a digital camera is opening up a new channel for document capture and understanding. Unfortunately, per...
Chew Lim Tan, Shijian Lu
ICDAR
1999
IEEE
13 years 11 months ago
Structured Document Segmentation and Representation by the Modified X-Y tree
In this paper we describe a top-down approach to the segmentation and representation of documents containing tabular structures. Examples of these documents are invoices and techn...
Francesca Cesarini, Marco Gori, Simone Marinai, Gi...
WWW
2004
ACM
14 years 8 months ago
Using urls and table layout for web classification tasks
We propose new features and algorithms for automating Web-page classification tasks such as content recommendation and ad blocking. We show that the automated classification of We...
L. K. Shih, David R. Karger
GBRPR
2007
Springer
13 years 11 months ago
An Efficient Ontology-Based Expert Peering System
Abstract. This paper proposes an expert peering system for information exchange. Our objective is to develop a real-time search engine for an online community where users can ask e...
Tansu Alpcan, Christian Bauckhage, Sachin Agarwal