Search Sciweavers | Sciweavers

21

ICAPR
2005
Springer

130views Pattern Recognition» more ICAPR 2005»

Combining Text and Link Analysis for Focused Crawling

14 years 1 months ago

The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we de...

George Almpanidis, Constantine Kotropoulos

claim paper

Read More »

26

click to vote

NIPS
2000

155views Information Technology» more NIPS 2000»

The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity

13 years 9 months ago

Download www.cs.cmu.edu

We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...

David A. Cohn, Thomas Hofmann

claim paper

Read More »

23

click to vote

ERCIMDL
2003
Springer

106views Education» more ERCIMDL 2003»

Topical Crawling for Business Intelligence

14 years 1 months ago

Download dollar.biz.uiowa.edu

Abstract. The Web provides us with a vast resource for business intelligence. However, the large size of the Web and its dynamic nature make the task of foraging appropriate inform...

Gautam Pant, Filippo Menczer

claim paper

Read More »

23

click to vote

WWW
2007
ACM

131views Internet Technology» more WWW 2007»

Efficient Update of Indexes for Dynamically Changing Web Documents

14 years 8 months ago

Download www.cs.duke.edu

Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...

Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...

claim paper

Read More »

27

click to vote

DEXAW
2010
IEEE

181views Database» more DEXAW 2010»

Towards a Search System for the Web Exploiting Spatial Data of a Web Document

13 years 9 months ago

Download laclavik.net

In this paper, we describe our work in progress in the scope of information retrieval exploiting the spatial data extracted from web documents. We discuss problems of a search for ...

Stefan Dlugolinsky, Michal Laclavik, Ladislav Hluc...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers