Content and Structure in Indexing and Ranking XML

15 years 7 months ago

Download webdb2004.cs.columbia.edu

Rooted in electronic publishing, XML is now widely used for modelling and storing structured text documents. Especially in the WWW, retrieval of XML documents is most useful in combination with a relevance-based ranking of the query result. Index structures with ranking support are therefore needed for fast access to relevant parts of large document collections. This paper proposes a classiﬁcation scheme for both XML ranking models and index structures, allowing to determine which index suits which ranking model. An analysis reveals that ranking parameters related to both the content and structure of the data are poorly supported by most known XML indices. The IR-CADG index, owing to its tight integration of content and structure, supports various XML ranking models in a very eﬃcient retrieval process. Experiments show that it outperforms separate content/structure indexing by more than two orders of magnitude for large corpora of several hundred MB.

Felix Weigel, Holger Meuss, Klaus U. Schulz, Fran&

Real-time Traffic

Index Structures | Internet Technology | Ranking | WEBDB 2004 | XML Ranking Models |

claim paper

» Aggregated Feature Retrieval for MPEG7

» CoXML A Cooperative XML Query Answering System

» Phil A Lazy Implementation of a Language for Approximate Filtering of XML Documents

» ViST A Dynamic Index Method for Querying XML Data by Tree Structures

» An Extended Preorder Index for Optimising XPath Expressions

» FLUX fuzzy content and structure matching of XML range queries

» Hierarchical Indexing and Flexible Element Retrieval for Structured Document

» Indexing and Searching XML Documents Based on Content and Structure Synopses

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	WEBDB
Authors	Felix Weigel, Holger Meuss, Klaus U. Schulz, François Bry

Comments (0)

Sciweavers

Content and Structure in Indexing and Ranking XML

Index Structures | Internet Technology | Ranking | WEBDB 2004 | XML Ranking Models |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers