Sciweavers

391 search results - page 27 / 79
» Finding and Extracting Data Records from Web Pages
Sort
View
ECTEL
2006
Springer
14 years 4 days ago
Finding Communities of Practice from User Profiles Based on Folksonomies
User profiles can be used to identify persons inside a community with similar interests. Folksonomy systems allow users to individually tag the objects of a common set (e.g., web p...
Jörg Diederich, Tereza Iofciu
WWW
2003
ACM
14 years 9 months ago
DOM-based content extraction of HTML documents
Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...
Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...
DILS
2009
Springer
14 years 3 months ago
Site-Wide Wrapper Induction for Life Science Deep Web Databases
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on lear...
Saqib Mir, Steffen Staab, Isabel Rojas
WWW
2005
ACM
14 years 9 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
IC
2003
13 years 9 months ago
An Analysis of Web Documents Retrieved and Viewed
The placement of Websites in ranked retrieval and the viewing patterns of Web search engine users is a crucial issue for Web site owners and Web search engines. However, little la...
Bernard J. Jansen, Amanda Spink