Sciweavers

330 search results - page 8 / 66
» Document structure analysis algorithms: a literature survey
Sort
View
DOCENG
2009
ACM
14 years 2 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
GRC
2005
IEEE
14 years 1 months ago
Semantic based clustering of Web documents
Abstract. A new methodology that structures the semantics of a collection of documents into the geometry of a simplicial complex is developed. A simplicial complex is topologically...
Tsau Young Lin, I-Jen Chiang
DEBU
2000
125views more  DEBU 2000»
13 years 7 months ago
Link Analysis in Web Information Retrieval
The analysis of the hyperlink structure of the web has led to significant improvements in web information retrieval. This survey describes two successful link analysis algorithms ...
Monika Rauch Henzinger
ICDAR
2011
IEEE
12 years 7 months ago
A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures
—Table detection is always an important task of document analysis and recognition. In this paper, we propose a novel and effective table detection method via visual separators an...
Jing Fang, Liangcai Gao, Kun Bai, Ruiheng Qiu, Xin...
DAS
2006
Springer
13 years 9 months ago
Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme 'th'
Abstract. This paper presents a study of 25 structural features extracted from samples of grapheme `th' that correspond to features commonly used by forensic document examiner...
Vladimir Pervouchine, Graham Leedham