Sciweavers

ICADL
2005
Springer

A Method for Creating a High Quality Collection of Researchers' Homepages from the Web

14 years 5 months ago
A Method for Creating a High Quality Collection of Researchers' Homepages from the Web
This paper proposes a method for creating a high quality collection of researchers’ homepages. The proposed method consists of three phases: rough filtering of the possible web pages, accurate evaluation of the web pages and precise selection of the correct homepages. For the rough filtering, the authors first define content-based keyword-lists, then generate filtering rules and relax the rules with heuristics. For the evaluation and the selection, they use a support vector machine with the feature sets derived from the content words of the web pages and propose an approach utilizing web-specific properties for improving the measures. Keyword Web Mining, Web Information Retrieval, Machine Learning, Web Page Classification
Yuxin Wang, Keizo Oyama
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where ICADL
Authors Yuxin Wang, Keizo Oyama
Comments (0)