Sciweavers

129 search results - page 5 / 26
» Combining content extraction heuristics: the CombinE system
Sort
View
WWW
2011
ACM
13 years 2 months ago
Track globally, deliver locally: improving content delivery networks by tracking geographic social cascades
Providers such as YouTube offer easy access to multimedia content to millions, generating high bandwidth and storage demand on the Content Delivery Networks they rely upon. More ...
Salvatore Scellato, Cecilia Mascolo, Mirco Musoles...
WIDM
2003
ACM
14 years 21 days ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
EMNLP
2007
13 years 9 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
WWW
2006
ACM
14 years 8 months ago
Detecting spam web pages through content analysis
In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...
Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...
IV
2000
IEEE
123views Visualization» more  IV 2000»
13 years 12 months ago
Content-Based Image Visualization
The proliferation of content-based image retrieval techniques has highlighted the need to understand the relationship between image clustering based on low-Ievel imagefeatures and...
Chaomei Chen, George Gagaudakis, Paul L. Rosin