Search Sciweavers | Sciweavers

200

Voted

AAAI
2007

166views Intelligent Agents» more AAAI 2007»

Mining Web Query Hierarchies from Clickthrough Data

15 years 9 months ago

In this paper, we propose to mine query hierarchies from clickthrough data, which is within the larger area of automatic acquisition of knowledge from the Web. When a user submits...

Dou Shen, Min Qin, Weizhu Chen, Qiang Yang, Zheng ...

claim paper

Read More »

219

click to vote

CIDR
2009

129views Algorithms» more CIDR 2009»

Extracting and Querying a Comprehensive Web Database

15 years 8 months ago

Download turing.cs.washington.edu

Recent research in domain-independent information extraction holds the promise of an automatically-constructed structured database derived from the Web. A query system based on th...

Michael J. Cafarella

claim paper

Read More »

245

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 7 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

180

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Combining classifiers to identify online databases

16 years 8 months ago

Download www2007.org

We address the problem of identifying the domain of online databases. More precisely, given a set F of Web forms automatically gathered by a focused crawler and an online database...

Luciano Barbosa, Juliana Freire

claim paper

Read More »

187

click to vote

ICIP
2000
IEEE

141views Image Processing» more ICIP 2000»

Efficient Video Similarity Measurement and Search

16 years 9 months ago

Download www.vis.uky.edu

We consider the use of meta-data and/or video-domain methods to detect similar videos on the web. Meta-data is extracted from the textual and hyperlink information associated with...

Sen-Ching S. Cheung, Avideh Zakhor

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers