Sciweavers

LAWEB
2006
IEEE

Analysis of Web Search Engine Clicked Documents

14 years 6 months ago
Analysis of Web Search Engine Clicked Documents
In this paper we process and analyze web search engine query and click data from the perspective of the documents (URL’s) selected. We initially define possible document categories and select descriptive variables to define the documents. The URL dataset is preprocessed and analyzed using some traditional statistical methods, and then processed by the Kohonen SOM clustering technique[5], which we use to produce a two level clustering. The clusters are interpreted in terms of the document categories and variables defined initially. Then we apply the C4.5[9] rule induction algorithm to produce a decision tree for the document category. The objective of the work is to apply a systematic data mining process to click data, contrasting non-supervised (Kohonen) and supervised (C4.5) methods to cluster and model the data, in order to identify document profiles which relate to theoretical user behavior, and document (URL) organization.
David F. Nettleton, Liliana Calderón-Benavi
Added 12 Jun 2010
Updated 12 Jun 2010
Type Conference
Year 2006
Where LAWEB
Authors David F. Nettleton, Liliana Calderón-Benavides, Ricardo A. Baeza-Yates
Comments (0)