Sciweavers

DEXA
2007
Springer

Beyond Lazy XML Parsing

14 years 6 months ago
Beyond Lazy XML Parsing
XML has become the standard format for data representation and exchange in domains ranging from Web to desktop applications. However, wide adoption of XML is hindered by inefficient document-parsing methods. Recent work on lazy parsing is a major step towards alleviating this problem. However, lazy parsers must still read the entire XML document in order to extract the overall document structure, due to the lack of internal navigation pointers inside XML documents. Further, these parsers must load and parse the entire virtual document tree into memory during XML query processing. These overheads significantly degrade the performance of navigation operations. We have developed a framework for efficient XML parsing based on the idea of placing internal physical pointers within the document, which allows skipping large portions of the document during parsing. The internal pointers are generated in a way that optimizes parsing for common navigation patterns. A double-Lazy Parser (2LP) is t...
Fernando Farfán, Vagelis Hristidis, Raju Ra
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where DEXA
Authors Fernando Farfán, Vagelis Hristidis, Raju Rangaswami
Comments (0)