Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

164

WWW
2007
ACM

147views Internet Technology» more WWW 2007»

First-order focused crawling

16 years 7 months ago

First-order focused crawling

Download www2007.org

This paper reports a new general framework of focused web crawling based on "relational subgroup discovery". Predicates are used explicitly to represent the relevance clues of those unvisited pages in the crawl frontier, and then firstorder classification rules are induced using subgroup discovery technique. The learned relational rules with sufficient support and confidence will guide the crawling process afterwards. We present the many interesting features of our proposed first-order focused crawler, together with preliminary promising experimental results. Categories and Subject Descriptors: H.5.4 [Information interfaces and presentation]: Hypertext/hypermedia; I.2.6 [Artificial intelligence]: Learning General Terms: Algorithms, performance, measurements

Qingyang Xu, Wanli Zuo

Real-time Traffic

First-order Focused Crawler | Internet Technology | Relational Subgroup Discovery | Subgroup Discovery Technique | WWW 2007 |

claim paper

Related Content

» A Focusing Inverse Method Theorem Prover for FirstOrder Linear Logic

» Focus and Context in Mixed Reality by Modulating First Order Salient Features

» Focused Crawling Using Context Graphs

» Geographically focused collaborative crawling

» Learning Ensembles of FirstOrder Clauses for RecallPrecision Curves A Case Study in Biomed...

» An Aprioribased Approach for FirstOrder Temporal Pattern Mining

» Evolutionary concept learning in First Order Logic An overview

» Untestable Properties Expressible with Four FirstOrder Quantifiers

» FirstOrder Query Rewriting for Inconsistent Databases

» FirstOrder Logic with Reachability Predicates on Infinite Systems

Post Info
More Details (n/a)

Added	21 Nov 2009
Updated	21 Nov 2009
Type	Conference
Year	2007
Where	WWW
Authors	Qingyang Xu, Wanli Zuo

Comments (0)