Sciweavers

1947 search results - page 7 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
WIDM
2003
ACM
14 years 1 months ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
NIPS
2008
13 years 10 months ago
Extracting State Transition Dynamics from Multiple Spike Trains with Correlated Poisson HMM
Neural activity is non-stationary and varies across time. Hidden Markov Models (HMMs) have been used to track the state transition among quasi-stationary discrete neural states. W...
Kentaro Katahira, Jun Nishikawa, Kazuo Okanoya, Ma...
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
14 years 1 months ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
SIGMOD
2008
ACM
159views Database» more  SIGMOD 2008»
14 years 8 months ago
Web-scale extraction of structured data
A long-standing goal of Web research has been to construct a unified Web knowledge base. Information extraction techniques have shown good results on Web inputs, but even most dom...
Michael J. Cafarella, Jayant Madhavan, Alon Y. Hal...
BNCOD
2006
88views Database» more  BNCOD 2006»
13 years 10 months ago
The Lixto Project: Exploring New Frontiers of Web Data Extraction
The Lixto project is an ongoing research effort in the area of Web data extraction. Whereas the project originally started out with the idea to develop a logic-based extraction lan...
Julien Carme, Michal Ceresna, Oliver Frölich,...