Sciweavers

160 search results - page 9 / 32
» Exploiting structural information for semi-structured docume...
Sort
View
CIKM
2010
Springer
13 years 4 months ago
Crawling the web for structured documents
Structured Information Retrieval is gaining a lot of interest in recent years, as this kind of information is becoming an invaluable asset for professional communities such as Sof...
Julián Urbano, Juan Loréns, Yorgos A...
WIDM
2003
ACM
14 years 22 days ago
Clustering documents in a web directory
Hierarchical categorization of documents is a task receiving growing interest due to the widespread proliferation of topic hierarchies for text documents. The worst problem of hie...
Giordano Adami, Paolo Avesani, Diego Sona
IPM
2000
76views more  IPM 2000»
13 years 7 months ago
Structured storage and retrieval of SGML documents using Grove
SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on dieren...
Hak-Gyoon Kim, Sung-Bae Cho
WWW
2005
ACM
14 years 8 months ago
Hubble: an advanced dynamic folder system for XML
Organizing large document collections for finding information easily and quickly has always been an important user requirement. This paper describes a flexible and powerful dynami...
Ning Li, Joshua Hui, Hui-I Hsiao, Kevin S. Beyer
SAC
2004
ACM
14 years 28 days ago
An optimized approach for KNN text categorization using P-trees
The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is...
Imad Rahal, William Perrizo