Search Sciweavers | Sciweavers

684 search results - page 68 / 137

» Elimination of Redundant Information for Web Data Mining

click to vote

ACMICEC
2006
ACM

141views ECommerce» more ACMICEC 2006»

From HTML documents to web tables and rules

14 years 1 months ago

Download www.informatik.uni-freiburg.de

We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...

Kai Simon, Georg Lausen, Harold Boley

claim paper

Read More »

click to vote

PKDD
2004
Springer

91views Data Mining» more PKDD 2004»

Summarization of Dynamic Content in Web Collections

14 years 1 months ago

Download www.miv.t.u-tokyo.ac.jp

This paper describes a new research proposal of multi-document summarization of dynamic content in web pages. Much information is lost in the Web due to the temporal character of w...

Adam Jatowt, Mitsuru Ishizuka

claim paper

Read More »

click to vote

ICDE
2009
IEEE

392views Database» more ICDE 2009»

FF-Anonymity: When Quasi-Identifiers Are Missing

15 years 7 months ago

Download www.cs.sfu.ca

Existing approaches on privacy-preserving data publishing rely on the assumption that data can be divided into quasi-identiﬁer attributes (QI) and sensitive attribute (SA). This ...

Ada Wai-Chee Fu, Ke Wang, Raymond Chi-Wing Wong, Y...

posted by arber

Read More »

click to vote

WWW
2007
ACM

137views Internet Technology» more WWW 2007»

Classifying web sites

14 years 8 months ago

Download www2007.org

In this paper, we present a novel method for the classification of Web sites. This method exploits both structure and content of Web sites in order to discern their functionality....

Christoph Lindemann, Lars Littig

claim paper

Read More »

click to vote

WWW
2005
ACM

99views Internet Technology» more WWW 2005»

The volume and evolution of web page templates

14 years 8 months ago

Download research.yahoo.com

Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...

David Gibson, Kunal Punera, Andrew Tomkins

claim paper

Read More »

« Prev « First page 68 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers