The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...
Online service providers are engaged in constant conflict with miscreants who try to siphon a portion of legitimate traffic to make illicit profits. We study the abuse of “tr...
Tyler Moore, Nektarios Leontiadis, Nicolas Christi...
The attention economy motivates participation in peerproduced sites on the Web like YouTube and Wikipedia. However, this economy appears to break down at work. We studied a large ...
Sarita Yardi, Scott A. Golder, Michael J. Brzozows...
Caching Web objects has become a common practice towards improving content delivery and users’ servicing. A Web caching framework is characterized by its cache replacement polic...
George Pallis, Athena Vakali, Eythimis Sidiropoulo...
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...