Web count statistics gathered from search engines have been widely used as a resource in a variety of NLP tasks. For some tasks, however, the information they exploit is not fine-...
The problem of information integration is discussed in the context of answering a query over the web. Querying the web requires that information from different web and other sourc...
This paper describes the development and evaluation of Synote, a freely available accessible web based application that makes multimedia web resources (e.g. podcasts) easier to ac...
The amount of information available online has grown enormously over the past decade. Fortunately, computing power, disk capacity, and network bandwidth have also increased dramat...
Sergey Brin, Rajeev Motwani, Lawrence Page, Terry ...
The book covers the following topics: examining the structure of HTTP requests, monitoring the packets being transferred between a web server and web browser, executing simple HTTP...