Search Sciweavers | Sciweavers

267 search results - page 37 / 54

» First-Order Learning for Web Mining

125

Voted

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 2 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

130

click to vote

KDD
2010
ACM

277views Data Mining» more KDD 2010»

Growing a tree in the forest: constructing folksonomies by integrating structured metadata

15 years 6 months ago

Download linqs.cs.umd.edu

Many social Web sites allow users to annotate the content with descriptive metadata, such as tags, and more recently to organize content hierarchically. These types of structured ...

Anon Plangprasopchok, Kristina Lerman, Lise Getoor

claim paper

Read More »

115

Voted

WWW
2008
ACM

124views Internet Technology» more WWW 2008»

iRobot: an intelligent crawler for web forums

16 years 3 months ago

Download www2008.org

We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...

Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...

claim paper

Read More »

176

Voted

WSDM
2012
ACM

207views Data Mining» more WSDM 2012»

Domain bias in web search

13 years 10 months ago

Download ilpubs.stanford.edu

This paper uncovers a new phenomenon in web search that we call domain bias — a user’s propensity to believe that a page is more relevant just because it comes from a particul...

Samuel Ieong, Nina Mishra, Eldar Sadikov, Li Zhang

claim paper

Read More »

113

Voted

AWIC
2003
Springer

140views Internet Technology» more AWIC 2003»

Web Page Classification: A Soft Computing Approach

15 years 7 months ago

Download gavab.escet.urjc.es

The Internet makes it possible to share and manipulate a vast quantity of information efficiently and effectively, but the rapid and chaotic growth experienced by the Net has gener...

Angela Ribeiro, Víctor Fresno, Maria C. Gar...

claim paper

Read More »

« Prev « First page 37 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers