Search Sciweavers | Sciweavers

178 search results - page 12 / 36

» Scheduling Algorithms for Web Crawling

198

click to vote

ESWS
2008
Springer

102views Internet Technology» more ESWS 2008»

Instance Based Clustering of Semantic Web Resources

15 years 8 months ago

Download www.dfki.uni-kl.de

Abstract. The original Semantic Web vision was explicit in the need for intelligent autonomous agents that would represent users and help them navigate the Semantic Web. We argue t...

Gunnar Aastrand Grimnes, Peter Edwards, Alun D. Pr...

claim paper

Read More »

213

click to vote

WWW
2011
ACM

219views Internet Technology» more WWW 2011»

Inverted index compression via online document routing

15 years 1 months ago

Download www.cs.yale.edu

Modern search engines are expected to make documents searchable shortly after they appear on the ever changing Web. To satisfy this requirement, the Web is frequently crawled. Due...

Gal Lavee, Ronny Lempel, Edo Liberty, Oren Somekh

claim paper

Read More »

158

click to vote

WWW
2003
ACM

131views Internet Technology» more WWW 2003»

Dynamic maintenance of web indexes using landmarks

16 years 7 months ago

Download www.research.ibm.com

Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...

Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...

claim paper

Read More »

173

click to vote

WWW
2005
ACM

95views Internet Technology» more WWW 2005»

Predictive ranking: a novel page ranking approach by estimating the web structure

16 years 7 months ago

Download www.cse.cuhk.edu.hk

PageRank (PR) is one of the most popular ways to rank web pages. However, as the Web continues to grow in volume, it is becoming more and more difficult to crawl all the available...

Haixuan Yang, Irwin King, Michael R. Lyu

claim paper

Read More »

200

click to vote

PAKDD
2009
ACM

116views Data Mining» more PAKDD 2009»

Scalable Web Mining with Newistic

16 years 1 months ago

Download www.horatiumocian.com

Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...

Ovidiu Dan, Horatiu Mocian

claim paper

Read More »

« Prev « First page 12 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers