Search Sciweavers | Sciweavers

178 search results - page 16 / 36

» Scheduling Algorithms for Web Crawling

197

click to vote

KDD
2007
ACM

189views Data Mining» more KDD 2007»

Corroborate and learn facts from the web

16 years 7 months ago

Download delivery.acm.org

The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...

Shubin Zhao, Jonathan Betz

claim paper

Read More »

160

click to vote

WWW
2010
ACM

201views Internet Technology» more WWW 2010»

Highlighting disputed claims on the web

16 years 1 months ago

Download berkeley.intel-research.net

We describe Dispute Finder, a browser extension that alerts a user when information they read online is disputed by a source that they might trust. Dispute Finder examines the tex...

Rob Ennals, Beth Trushkowsky, John Mark Agosta

claim paper

Read More »

181

click to vote

WIDM
2006
ACM

148views Internet Technology» more WIDM 2006»

Coarse-grained classification of web sites by their structural properties

16 years 22 days ago

Download rvs.informatik.uni-leipzig.de

In this paper, we identify and analyze structural properties which reflect the functionality of a Web site. These structural properties consider the size, the organization, the co...

Christoph Lindemann, Lars Littig

claim paper

Read More »

175

click to vote

WAW
2004
Springer

150views Algorithms» more WAW 2004»

Do Your Worst to Make the Best: Paradoxical Effects in PageRank Incremental Computations

16 years 4 days ago

Download vigna.dsi.unimi.it

d Abstract) Paolo Boldi† Massimo Santini‡ Sebastiano Vigna∗ Deciding which kind of visit accumulates high-quality pages more quickly is one of the most often debated issue i...

Paolo Boldi, Massimo Santini, Sebastiano Vigna

claim paper

Read More »

142

click to vote

WWW
2005
ACM

103views Internet Technology» more WWW 2005»

An information extraction engine for web discussion forums

16 years 10 days ago

Download www.www2005.org

In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...

Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...

claim paper

Read More »

« Prev « First page 16 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers