Search Sciweavers | Sciweavers

101 search results - page 9 / 21

» First-order focused crawling

121

click to vote

SIGIR
2008
ACM

104views Information Technology» more SIGIR 2008»

Compressed collections for simulated crawling

15 years 3 months ago

Download www.sigir.org

Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...

Alessio Orlandi, Sebastiano Vigna

claim paper

Read More »

219

Voted

ICDE
2006
IEEE

146views Database» more ICDE 2006»

Query Selection Techniques for Efficient Crawling of Structured Web Sources

16 years 4 months ago

Download research.microsoft.com

The high quality, structured data from Web structured sources is invaluable for many applications. Hidden Web databases are not directly crawlable by Web search engines and are on...

Ping Wu, Ji-Rong Wen, Huan Liu, Wei-Ying Ma

claim paper

Read More »

107

click to vote

IAT
2009
IEEE

95views Intelligent Agents» more IAT 2009»

Intelligent Crawling in Virtual Worlds

15 years 10 months ago

Download vw.ddns.uark.edu

—We present an intelligent agent crawler designed to collect user-generated content in Second Life and related virtual worlds. The agents navigate autonomously through the world ...

Josh Eno, Susan Gauch, Craig W. Thompson

claim paper

Read More »

228

click to vote

SIGMOD
2006
ACM

232views Database» more SIGMOD 2006»

To search or to crawl?: towards a query optimizer for text-centric tasks

16 years 3 months ago

Download pages.stern.nyu.edu

Text is ubiquitous and, not surprisingly, many important applications rely on textual data for a variety of tasks. As a notable example, information extraction applications derive...

Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay ...

claim paper

Read More »

140

click to vote

WWW
2005
ACM

151views Internet Technology» more WWW 2005»

User-centric Web crawling

16 years 3 months ago

Download www2005.org

Search engines are the primary gateways of information access on the Web today. Behind the scenes, search engines crawl the Web to populate a local indexed repository of Web pages...

Sandeep Pandey, Christopher Olston

claim paper

Read More »

« Prev « First page 9 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers