Sciweavers

472 search results - page 74 / 95
» Crawling the Hidden Web
Sort
View
WWW
2001
ACM
14 years 9 months ago
Seeing the whole in parts: text summarization for web browsing on handheld devices
We introduce five methods for summarizing parts of Web pages on handheld devices, such as personal digital assistants (PDAs), or cellular phones. Each Web page is broken into text...
Orkut Buyukkokten, Hector Garcia-Molina, Andreas P...
WEBDB
2009
Springer
124views Database» more  WEBDB 2009»
14 years 3 months ago
Bridging the Terminology Gap in Web Archive Search
Web archives play an important role in preserving our cultural heritage for future generations. When searching them, a serious problem arises from the fact that terminology evolve...
Klaus Berberich, Srikanta J. Bedathur, Mauro Sozio...
EDBTW
2004
Springer
14 years 2 months ago
Clustering Structured Web Sources: A Schema-Based, Model-Differentiation Approach
Abstract. The Web has been rapidly “deepened” with the prevalence of databases online. On this “deep Web,” numerous sources are structured, providing schema-rich data– Th...
Bin He, Tao Tao, Kevin Chen-Chuan Chang
SIGMOD
2010
ACM
165views Database» more  SIGMOD 2010»
13 years 9 months ago
Creating and exploring web form repositories
We present DeepPeep (http://www.deeppeep.org), a new system for discovering, organizing and analyzing Web forms. DeepPeep allows users to explore the entry points to hidden-Web si...
Luciano Barbosa, Hoa Nguyen, Thanh Hoang Nguyen, R...
ICIP
1999
IEEE
14 years 10 months ago
Color Documents on the Web with DJVU
We present a new image compression technique called DjVu" that is speci cally geared towards the compression of scanned documents in color at high resolution. With DjVu, a ma...
Bill Riemers, Léon Bottou, Pascal Vincent, ...