Sciweavers

1014 search results - page 5 / 203
» Using Keyword Extraction for Web Site Clustering
Sort
View
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
14 years 3 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
IJCAI
2007
13 years 11 months ago
An Improved Probabilistic Ant based Clustering for Distributed Databases
In this paper we present an improved version of the Probabilistic Ant based Clustering Algorithm for Distributed Databases (PACE). The most important feature of this algorithm is ...
Chandrasekar Ramachandran, Thanukrishnan Srinivasa...
IADIS
2008
13 years 11 months ago
Obtaining User Profiles Via Web Usage Mining
In this paper, we present a model to obtain and analyze user profiles after a process of web usage mining where log files are processed. The web log files register the activity of...
Maria J. Martín-Bautista, María Ampa...
WWW
2009
ACM
14 years 2 months ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
14 years 10 months ago
Web site mining: a new way to spot competitors, customers and suppliers in the world wide web
When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...