Search Sciweavers | Sciweavers

27

WWW
2003
ACM

133views Internet Technology» more WWW 2003»

Efficient URL caching for world wide web crawling

14 years 8 months ago

Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...

Andrei Z. Broder, Marc Najork, Janet L. Wiener

claim paper

Read More »

21

click to vote

ATAL
2004
Springer

156views Intelligent Agents» more ATAL 2004»

QueryTracker: An Agent for Tracking Persistent Information Needs

14 years 23 days ago

Download www.cs.colostate.edu

Most people have long term information interests. Current Web search engines satisfy immediate information needs. Speciﬁc sites support tracking of long term interests. We prese...

Gabriel Somlo, Adele E. Howe

claim paper

Read More »

25

click to vote

CN
2006

163views more CN 2006»

A framework for mining evolving trends in Web data streams using dynamic learning and retrospective validation

13 years 7 months ago

Download webmining.spd.louisville.edu

The expanding and dynamic nature of the Web poses enormous challenges to most data mining techniques that try to extract patterns from Web data, such as Web usage and Web content....

Olfa Nasraoui, Carlos Rojas, Cesar Cardona

claim paper

Read More »

20

click to vote

HCI
2009

147views Human Computer Interaction» more HCI 2009»

User Reputation Evaluation Using Co-occurrence Feature and Collective Intelligence

13 years 5 months ago

Download cs.yonsei.ac.kr

It becomes more difficult to find valuable contents in the Web 2.0 environment since lots of inexperienced users provide many unorganized contents. In the previous researches, peop...

Jeong-Won Cha, Hyun-woo Lee, Yo-Sub Han, Laehyun K...

claim paper

Read More »

21

click to vote

WEBI
2010
Springer

172views Internet Technology» more WEBI 2010»

A Scalable Indexing Mechanism for Ontology-Based Information Integration

13 years 5 months ago

Download www3.lehigh.edu

In recent years, there has been an explosion of publicly available RDF and OWL web pages. Some of these pages are static text files, while others are dynamically generated from la...

Yingjie Li, Abir Qasem, Jeff Heflin

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers