Search Sciweavers | Sciweavers

472 search results - page 10 / 95

» Crawling the Hidden Web

185

click to vote

ICDE
2006
IEEE

144views Database» more ICDE 2006»

Finding Thai Web Pages in Foreign Web Spaces

16 years 1 months ago

Download www.ieice.org

While the Web has been increasingly recognized as a culturally valuable social artifact, many nations endeavor to create national Web archives for long term preservation. However, ...

Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kit...

claim paper

Read More »

207

click to vote

ICDM
2006
IEEE

164views Data Mining» more ICDM 2006»

Unsupervised Learning of Tree Alignment Models for Information Extraction

16 years 1 months ago

Download users.soe.ucsc.edu

We propose an algorithm for extracting ﬁelds from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...

Philip Zigoris, Damian Eads, Yi Zhang

claim paper

Read More »

181

click to vote

SIGIR
2008
ACM

104views Information Technology» more SIGIR 2008»

Compressed collections for simulated crawling

15 years 7 months ago

Download www.sigir.org

Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...

Alessio Orlandi, Sebastiano Vigna

claim paper

Read More »

143

click to vote

WWW
2005
ACM

138views Internet Technology» more WWW 2005»

Crawling a country: better strategies than breadth-first for web page ordering

16 years 7 months ago

Download www.tejedoresdelweb.com

Ricardo A. Baeza-Yates, Carlos Castillo, Mauricio ...

claim paper

Read More »

174

click to vote

WWW
2006
ACM

96views Internet Technology» more WWW 2006»

What's really new on the web?: identifying new pages from a series of unstable web snapshots

16 years 7 months ago

Download www.tkl.iis.u-tokyo.ac.jp

Identifying and tracking new information on the Web is important in sociology, marketing, and survey research, since new trends might be apparent in the new information. Such chan...

Masashi Toyoda, Masaru Kitsuregawa

claim paper

Read More »

« Prev « First page 10 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers