Sciweavers

ICDM
2002
IEEE
191views Data Mining» more  ICDM 2002»
14 years 1 months ago
Iterative Clustering of High Dimensional Text Data Augmented by Local Search
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a popular method for clustering document collections. However, spherical k-means ca...
Inderjit S. Dhillon, Yuqiang Guan, J. Kogan
ICDM
2002
IEEE
106views Data Mining» more  ICDM 2002»
14 years 1 months ago
Neighborgram Clustering Interactive Exploration of Cluster Neighborhoods
Proceedings of IEEE Data Mining, IEEE Press, pp. 581-584, 2002. We describe an interactive way to generate a set of clusters for a given data set. The clustering is done by constr...
Michael R. Berthold, Bernd Wiswedel, David E. Patt...
ICDM
2002
IEEE
138views Data Mining» more  ICDM 2002»
14 years 1 months ago
Extraction Techniques for Mining Services from Web Sources
The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services...
Hasan Davulcu, Saikat Mukherjee, I. V. Ramakrishna...
ICDM
2002
IEEE
153views Data Mining» more  ICDM 2002»
14 years 1 months ago
Generating an informative cover for association rules
Mining association rules may generate a large numbers of rules making the results hard to analyze manually. Pasquier et al. have discussed the generation of GuiguesDuquenne–Luxe...
Laurentiu Cristofor, Dan A. Simovici
ICDM
2002
IEEE
114views Data Mining» more  ICDM 2002»
14 years 1 months ago
Online Algorithms for Mining Semi-structured Data Stream
In this paper, we study an online data mining problem from streams of semi-structured data such as XML data. Modeling semi-structured data and patterns as labeled ordered trees, w...
Tatsuya Asai, Hiroki Arimura, Kenji Abe, Shinji Ka...
ICDM
2002
IEEE
132views Data Mining» more  ICDM 2002»
14 years 1 months ago
Speed-up Iterative Frequent Itemset Mining with Constraint Changes
Mining of frequent itemsets is a fundamental data mining task. Past research has proposed many efficient algorithms for the purpose. Recent work also highlighted the importance of...
Gao Cong, Bing Liu
ICDM
2002
IEEE
130views Data Mining» more  ICDM 2002»
14 years 1 months ago
Unsupervised Segmentation of Categorical Time Series into Episodes
This paper describes an unsupervised algorithm for segmenting categorical time series into episodes. The VOTING-EXPERTS algorithm first collects statistics about the frequency an...
Paul R. Cohen, Brent Heeringa, Niall M. Adams
ICDM
2002
IEEE
109views Data Mining» more  ICDM 2002»
14 years 1 months ago
Evolutionary Time Series Segmentation for Stock Data Mining
Korris Fu-Lai Chung, Tak-Chung Fu, Robert W. P. Lu...
ICDM
2002
IEEE
123views Data Mining» more  ICDM 2002»
14 years 1 months ago
Towards Automatic Generation of Query Taxonomy: A Hierarchical Query Clustering Approach
Previous works on automatic query clustering most generate a flat, un-nested partition of query terms. In this work, we are pursuing to organize query terms into a hierarchical s...
Shui-Lung Chuang, Lee-Feng Chien
ICDM
2002
IEEE
133views Data Mining» more  ICDM 2002»
14 years 1 months ago
Learning with Progressive Transductive Support Vector Machine
Support vector machine (SVM) is a new learning method developed in recent years based on the foundations of statistical learning theory. By taking a transductive approach instead ...
Yisong Chen, Guoping Wang, Shihai Dong