Sciweavers

ADC
2003
Springer
153views Database» more  ADC 2003»
14 years 4 months ago
Automated Discovery of Search Interfaces on the Web
Web search engines work well for finding crawlable pages, but not for finding datasets hidden behind Web search forms. We describe a novel technique for detecting search forms, ...
Jared Cope, Nick Craswell, David Hawking
WWW
2001
ACM
15 years 6 days ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina