Sciweavers

1700 search results - page 221 / 340
» Combinatorial Pattern Matching
Sort
View
WWW
2005
ACM
14 years 11 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
PODS
2007
ACM
104views Database» more  PODS 2007»
14 years 11 months ago
XML transformation by tree-walking transducers with invisible pebbles
The pebble tree automaton and the pebble tree transducer are enhanced by additionally allowing an unbounded number of `invisible' pebbles (as opposed to the usual `visible�...
Joost Engelfriet, Hendrik Jan Hoogeboom, Bart Samw...
EDBT
2008
ACM
159views Database» more  EDBT 2008»
14 years 11 months ago
Automaton in or out: run-time plan optimization for XML stream processing
Many systems such as Tukwila and YFilter combine automaton and algebra techniques to process queries over tokenized XML streams. Typically in this architecture, an automaton is fi...
Hong Su, Elke A. Rundensteiner, Murali Mani
EDBT
2009
ACM
123views Database» more  EDBT 2009»
14 years 5 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
ICPR
2008
IEEE
14 years 5 months ago
Generative models for fingerprint individuality using ridge models
Generative models of pattern individuality attempt to learn the distribution of observed quantitative features to determine the probability of two random patterns being the same. ...
Chang Su, Sargur N. Srihari