Sciweavers

466 search results - page 23 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
DMIN
2006
150views Data Mining» more  DMIN 2006»
13 years 11 months ago
Effect of Document Representation on the Performance of Medical Document Classification
Text classification in the medical domain is a real world problem with wide applicability. This paper investigates extensively the effect of text representation approaches on the p...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
ICDE
2005
IEEE
106views Database» more  ICDE 2005»
14 years 3 months ago
Reconstructing XML Subtrees from Relational Storage of XML documents
Numerous researchers have proposed to use relational databases to store and query XML documents. One important component of such systems is the XML subtree reconstruction, which r...
Artem Chebotko, Dapeng Liu, Mustafa Atay, Shiyong ...
ICIP
2001
IEEE
14 years 11 months ago
Similarity measure for CCITT Group 4 compressed document images
Similarity measure of document images acts a crucial role in the area of document image retrieval. A method of measuring the similarity of CCITT Group 4 compressed document images...
Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang
EDBT
2009
ACM
123views Database» more  EDBT 2009»
14 years 4 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
CIKM
2000
Springer
14 years 2 months ago
Scalable association-based text classification
Naïve Bayes (NB) classifier has long been considered a core methodology in text classification mainly due to its simplicity and computational efficiency. There is an increasing n...
Dimitris Meretakis, Dimitris Fragoudis, Hongjun Lu...