Fast In-Memory XPath Search using Compressed Indexes

16 years 6 months ago

Download www.dcc.uchile.cl

A large fraction of an XML document typically consists of text data. The XPath query language allows text search via the equal, contains, and starts-with predicates. Such predicates can be efficiently implemented using a compressed self-index of the document's text nodes. Most queries, however, contain some parts querying the text of the document, plus some parts querying the tree structure. It is therefore a challenge to choose an appropriate evaluation order for a given query, which optimally leverages the execution speeds of the text and tree indexes. Here the SXSI system is introduced. It stores the tree structure of an XML document using a bit array of opening and closing brackets plus a sequence of labels, and stores the text nodes of the document using a global compressed self-index. On top of these indexes sits an XPath query engine that is based on tree automata. The engine uses fast counting queries of the text index in order to dynamically determine whether to evaluate ...

Diego Arroyuelo, Francisco Claude, Sebastian Manet

Real-time Traffic

Database | Fast Counting Queries | ICDE 2010 | Pure Tree Queries | Text Nodes |

claim paper

» XPath Whole Query Optimization

» Optimizing positional index structures for versioned document collections

» Compression Indexing and Retrieval for Massive String Data

» Type less find more fast autocompletion search with a succinct index

» Inverted indexes for phrases and strings

» XLeaf Twig Evaluation with Skipping Loop Joins and Virtual Nodes

» Improving Web search efficiency via a locality based static pruning method

» Intersection in Integer Inverted Indices

Post Info
More Details (n/a)

Added	20 Dec 2009
Updated	03 Jan 2010
Type	Conference
Year	2010
Where	ICDE
Authors	Diego Arroyuelo, Francisco Claude, Sebastian Maneth, Veli Mäkinen, Gonzalo Navarro, Kim Nguyen, Jouni Sirén, Niko Välimäki

Comments (0)

Sciweavers

Fast In-Memory XPath Search using Compressed Indexes

Database | Fast Counting Queries | ICDE 2010 | Pure Tree Queries | Text Nodes |

Explore & Download

Productivity Tools

Sciweavers