Search Sciweavers | Sciweavers

2677 search results - page 36 / 536

» Extracting Structured Data from Web Pages

196

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

16 years 7 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

196

click to vote

CIKM
2005
Springer

134views Information Technology» more CIKM 2005»

Versatile structural disambiguation for semantic-aware applications

16 years 13 days ago

Download www.isgroup.unimo.it

In this paper, we propose a versatile disambiguation approach which can be used to make explicit the meaning of structure based information such as XML schemas, XML document struc...

Federica Mandreoli, Riccardo Martoglia, Enrico Ron...

claim paper

Read More »

173

Voted

WWW
2006
ACM

96views Internet Technology» more WWW 2006»

What's really new on the web?: identifying new pages from a series of unstable web snapshots

16 years 7 months ago

Download www.tkl.iis.u-tokyo.ac.jp

Identifying and tracking new information on the Web is important in sociology, marketing, and survey research, since new trends might be apparent in the new information. Such chan...

Masashi Toyoda, Masaru Kitsuregawa

claim paper

Read More »

217

click to vote

WWW
2005
ACM

154views Internet Technology» more WWW 2005»

Thresher: automating the unwrapping of semantic content from the World Wide Web

16 years 7 months ago

Download www2005.org

We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...

Andrew Hogue, David R. Karger

claim paper

Read More »

146

click to vote

WISE
2000
Springer

99views Internet Technology» more WISE 2000»

Structured Web Pages Management for Efficient Data Retrieval

15 years 11 months ago

Sciweavers

Explore & Download

Productivity Tools

Sciweavers