Search Sciweavers | Sciweavers

178 search results - page 19 / 36

» Scheduling Algorithms for Web Crawling

192

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 1 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

213

click to vote

WWW
2007
ACM

189views Internet Technology» more WWW 2007»

Extraction and classification of dense communities in the web

16 years 7 months ago

Download www2007.org

The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understandi...

Yon Dourisboure, Filippo Geraci, Marco Pellegrini

claim paper

Read More »

201

click to vote

WWW
2009
ACM

125views Internet Technology» more WWW 2009»

Triplify: light-weight linked data publication from relational databases

16 years 7 months ago

Download www.informatik.uni-leipzig.de

In this paper we present Triplify ? a simplistic but effective approach to publish Linked Data from relational databases. Triplify is based on mapping HTTP-URI requests onto relat...

Sören Auer, Sebastian Dietzold, Jens Lehmann,...

claim paper

Read More »

187

click to vote

IPPS
2005
IEEE

144views Distributed And Parallel Com...» more IPPS 2005»

QoS Aware Job Scheduling in a Cluster-Based Web Server for Multimedia Applications

16 years 13 days ago

Download www.cse.ohio-state.edu

We propose a cluster-based web server where a few computing nodes are separately reserved for high-performance computing applications, such as multimedia, SSL, and CGI. As an exam...

Jiani Guo, Laxmi N. Bhuyan, Raj Kumar, Sujoy Basu

claim paper

Read More »

174

click to vote

WWW
2007
ACM

98views Internet Technology» more WWW 2007»

The discoverability of the web

16 years 7 months ago

Download www2007.org

Previous studies have highlighted the high arrival rate of new content on the web. We study the extent to which this new content can be efficiently discovered by a crawler. Our st...

Anirban Dasgupta, Arpita Ghosh, Ravi Kumar, Christ...

claim paper

Read More »

« Prev « First page 19 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers