Sciweavers

296 search results - page 30 / 60
» Classifying XML Documents by Using Genre Features
Sort
View
ICDAR
2003
IEEE
15 years 8 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
121
Voted
PRICAI
2000
Springer
15 years 6 months ago
Text Retrieval from Document Images based on N-Gram Algorithm
In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm. We directly extract image features instead of us...
Chew Lim Tan, Sam Yuan Sung, Zhaohui Yu, Yi Xu
126
Voted
IJCNLP
2005
Springer
15 years 8 months ago
Classifying Chinese Texts in Two Steps
Abstract. This paper proposes a two-step method for Chinese text categorization (TC). In the first step, a Naïve Bayesian classifier is used to fix the fuzzy area between two cate...
Xinghua Fan, Maosong Sun, Key-Sun Choi, Qin Zhang
CIKM
2001
Springer
15 years 7 months ago
X007: Applying 007 Benchmark to XML Query Processing Tool
If XML is to play the critical role of the lingua franca for Internet data interchange that many predict, it is necessary to start designing and adopting benchmarks allowing the c...
Stéphane Bressan, Gillian Dobbie, Zoé...
120
Voted
ICPR
2004
IEEE
16 years 3 months ago
Serialized Unsupervised Classifier for Adaptative Color Image Segmentation: Application to Digitized Ancient Manuscripts
This paper presents an adaptative algorithm for the segmentation of color images suited for document image analysis. The algorithm is based on a serialization of the k-means algor...
Frank Le Bourgeois, Hubert Emptoz, Yann Leydier