Soon, much of the data exchanged over the Internet will be encoded in XML, allowing for sophisticated filtering and content-based routing. We have built a filtering engine called YFilter, which filters streaming XML documents according to XQuery or XPath queries that involve both path expressions and predicates. Unlike previous work, YFilter uses a novel NFA-based execution model. In this demonstration, we present the structures and algorithms underlying YFilter, and show its efficiency and scalability under various workloads. 1 Overview Recently, there has been growing interest in the filtering and routing of data based on user preferences. In an XML filtering system, continuously arriving XML documents are routed to users according to subscriptions specified as queries. XML allows the encoding of semantic and structural information that can improve delivery accuracy. For large systems, filtering efficiency and scalability are of paramount concern. Filtering systems have traditionall...
Yanlei Diao, Peter M. Fischer, Michael J. Franklin