Sciweavers

249 search results - page 21 / 50
» Classification of Documents Based on the Structure of Their ...
Sort
View
DOCENG
2009
ACM
14 years 2 months ago
Automated extensible XML tree diagrams
: © Automated Extensible XML Tree Diagrams John Lumley HP Laboratories HPL-2009-137 XSLT, SVG, XML trees, Functional programming XML is a tree-oriented meta-language and visual de...
John Lumley
DAS
2006
Springer
13 years 11 months ago
On Benchmarking of Invoice Analysis Systems
Abstract. An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchma...
Bertin Klein, Stefan Agne, Andreas Dengel
IJIS
2008
123views more  IJIS 2008»
13 years 7 months ago
Algorithms of nonlinear document clustering based on fuzzy multiset model
Abstract: Fuzzy multiset is applicable as a model of information retrieval because it has the mathematical structure which expresses the number and the degree of attribution of an ...
Kiyotaka Mizutani, Ryo Inokuchi, Sadaaki Miyamoto
DASFAA
2004
IEEE
135views Database» more  DASFAA 2004»
13 years 11 months ago
Semi-supervised Text Classification Using Partitioned EM
Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...
Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu
PKDD
2007
Springer
141views Data Mining» more  PKDD 2007»
14 years 1 months ago
Automatic Hidden Web Database Classification
In this paper, a method for automatic classification of Hidden-Web databases is addressed. In our approach, the classification tree for Hidden Web databases is constructed by tailo...
Zhiguo Gong, Jingbai Zhang, Qian Liu