Search Sciweavers | Sciweavers

19 search results - page 1 / 4

» Effective web-scale crawling through website analysis

210

click to vote

WWW
2006
ACM

237views Internet Technology» more WWW 2006»

Effective web-scale crawling through website analysis

16 years 7 months ago

Download people.csail.mit.edu

The web crawler space is often delimited into two general areas: full-web crawling and focused crawling. We present netSifter, a crawler system which integrates features from thes...

Iván Gonzlez, Adam Marcus 0002, Daniel N. M...

claim paper

Read More »

167

click to vote

WWW
2007
ACM

98views Internet Technology» more WWW 2007»

A large-scale study of robots.txt

16 years 7 months ago

Download www2007.org

Search engines largely rely on Web robots to collect information from the Web. Due to the unregulated open-access nature of the Web, robot activities are extremely diverse. Such c...

Yang Sun, Ziming Zhuang, C. Lee Giles

claim paper

Read More »

203

click to vote

EMNLP
2009

121views Natural Language Processing» more EMNLP 2009»

Web-Scale Distributional Similarity and Entity Set Expansion

15 years 4 months ago

Download www.aclweb.org

Computing the pairwise semantic similarity between all words on the Web is a computationally challenging task. Parallelization and optimizations are necessary. We propose a highly...

Patrick Pantel, Eric Crestan, Arkady Borkovsky, An...

claim paper

Read More »

186

click to vote

SIGIR
2006
ACM

178views Information Technology» more SIGIR 2006»

AggregateRank: bringing order to web sites

16 years 24 days ago

Download research.microsoft.com

Since the website is one of the most important organizational structures of the Web, how to effectively rank websites has been essential to many Web applications, such as Web sear...

Guang Feng, Tie-Yan Liu, Ying Wang, Ying Bao, Zhim...

claim paper

Read More »

145

click to vote

ASSETS
2004
ACM

109views Emerging Technology» more ASSETS 2004»

Accessibility of Internet websites through time

16 years 9 days ago

Download www.pitt.edu

Using Internet Archive’s Wayback Machine, a random sample of websites from 1997-2002 were retrospectively analyzed for effects that technology has on accessibility for persons w...

Stephanie Hackett, Bambang Parmanto, Xiaoming Zeng

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers