Sciweavers

66 search results - page 2 / 14
» Measuring the Structural Similarity of Semistructured Docume...
Sort
View
EP
1998
Springer
13 years 11 months ago
Measuring Structural Similarity Among Web Documents: Preliminary Results
When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...
Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...
WEBI
2004
Springer
14 years 23 days ago
Semi-Structured Complex List Extraction
The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...
Anders Arpteg
IPM
2007
95views more  IPM 2007»
13 years 7 months ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
ENC
2005
IEEE
14 years 1 months ago
Combining Structural and Textual Contexts for Compressing Semistructured Databases
We describe a compression technique for semistructured documents, called SCMPPM, which combines the Prediction by Partial Matching technique with Structural Contexts Model (SCM) t...
Joaquín Adiego, Pablo de la Fuente, Gonzalo...
AUSDM
2008
Springer
230views Data Mining» more  AUSDM 2008»
13 years 9 months ago
Combining Structure and Content Similarities for XML Document Clustering
This paper proposes a clustering approach that explores both the content and the structure of XML documents for determining similarity among them. Assuming that the content and th...
Tien Tran, Richi Nayak, Peter Bruza