Search Sciweavers | Sciweavers

511 search results - page 67 / 103

» Discovering data dependencies in Web content mining

126

click to vote

ACMICEC
2006
ACM

141views ECommerce» more ACMICEC 2006»

From HTML documents to web tables and rules

15 years 10 months ago

Download www.informatik.uni-freiburg.de

We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...

Kai Simon, Georg Lausen, Harold Boley

claim paper

Read More »

221

click to vote

IDA
2011
Springer

312views Information Technology» more IDA 2011»

A parallel, distributed algorithm for relational frequent pattern discovery from very large data sets

14 years 11 months ago

Download www.di.uniba.it

The amount of data produced by ubiquitous computing applications is quickly growing, due to the pervasive presence of small devices endowed with sensing, computing and communicatio...

Annalisa Appice, Michelangelo Ceci, Antonio Turi, ...

claim paper

Read More »

119

click to vote

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Vetting the links of the web

15 years 11 months ago

Download www.cse.lehigh.edu

Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...

Na Dai, Brian D. Davison

claim paper

Read More »

161

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 5 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

123

click to vote

PKDD
2007
Springer

120views Data Mining» more PKDD 2007»

Site-Independent Template-Block Detection

15 years 10 months ago

Download research.microsoft.com

Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

« Prev « First page 67 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers