Sciweavers

2677 search results - page 73 / 536
» Extracting Structured Data from Web Pages
Sort
View
BMCBI
2008
91views more  BMCBI 2008»
13 years 8 months ago
PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval
Background: Graph analysis algorithms such as PageRank and HITS have been successful in Web environments because they are able to extract important inter-document relationships fr...
Jimmy J. Lin
WWW
2005
ACM
14 years 9 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
CIKM
2008
Springer
13 years 10 months ago
A densitometric approach to web page segmentation
Web Page segmentation is a crucial step for many applications in Information Retrieval, such as text classification, de-duplication and full-text search. In this paper we describe...
Christian Kohlschütter, Wolfgang Nejdl
PAKDD
2001
ACM
157views Data Mining» more  PAKDD 2001»
14 years 1 months ago
Applying Pattern Mining to Web Information Extraction
Information extraction (IE) from semi-structured Web documents is a critical issue for information integration systems on the Internet. Previous work in wrapper induction aim to so...
Chia-Hui Chang, Shao-Chen Lui, Yen-Chin Wu
IDEAS
2000
IEEE
98views Database» more  IDEAS 2000»
14 years 1 months ago
Keeping Web Pages Up-to-Date with SQL: 1999
From the beginnings of the World Wide Web (WWW or Web) and the definition of the Common Gateway Interface (CGI), Web site administrators have used dynamically generated HTML page...
Henrik Loeser