Sciweavers

IIS
2004

Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data

14 years 25 days ago
Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search hits list, returned from a search engine. In this paper we present the results of an experimental evaluation of a new algorithm named Lingo. We use Open Directory Project as a source of high-quality narrowtopic document references and mix them into several multi-topic test sets for the algorithm. We then compare the clusters acquired from Lingo to the expected set of ODP categories mixed in the input. Finally we discuss observations from the experiment, highlighting the algorithm's strengths and weaknesses and conclude with research directions for the future.
Stanislaw Osinski, Dawid Weiss
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where IIS
Authors Stanislaw Osinski, Dawid Weiss
Comments (0)