Sciweavers

1914 search results - page 86 / 383
» Predicting Web Information Content
Sort
View
WWW
2005
ACM
14 years 9 months ago
Extracting context to improve accuracy for HTML content extraction
Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo
PERCOM
2008
ACM
14 years 8 months ago
SeeNSearch: A context directed search facilitator for home entertainment devices
The Internet has become an extremely popular source of entertainment and information. But, despite the growing amount of media content, most Web sites today are designed for acces...
Alan Messer, Anugeetha Kunjithapatham, Phuong Nguy...
WWW
2005
ACM
14 years 9 months ago
WEESA: Web engineering for semantic Web applications
The success of the Semantic Web crucially depends on the existence of Web pages that provide machine-understandable meta-data. This meta-data is typically added in the semantic an...
Gerald Reif, Harald Gall, Mehdi Jazayeri
WWW
2011
ACM
13 years 3 months ago
HyLiEn: a hybrid approach to general list extraction on the web
We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...
Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...
WWW
2007
ACM
14 years 9 months ago
The discoverability of the web
Previous studies have highlighted the high arrival rate of new content on the web. We study the extent to which this new content can be efficiently discovered by a crawler. Our st...
Anirban Dasgupta, Arpita Ghosh, Ravi Kumar, Christ...