Sciweavers

KDD
2005
ACM
160views Data Mining» more  KDD 2005»
14 years 12 months ago
Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering
Heterogeneous data co-clustering has attracted more and more attention in recent years due to its high impact on various applications. While the co-clustering algorithms for two t...
Bin Gao, Tie-Yan Liu, Xin Zheng, QianSheng Cheng, ...
KDD
2005
ACM
117views Data Mining» more  KDD 2005»
14 years 12 months ago
Rule extraction from linear support vector machines
We describe an algorithm for converting linear support vector machines and any other arbitrary hyperplane-based linear classifiers into a set of non-overlapping rules that, unlike...
Glenn Fung, Sathyakama Sandilya, R. Bharat Rao
KDD
2005
ACM
193views Data Mining» more  KDD 2005»
14 years 12 months ago
An approach to spacecraft anomaly detection problem using kernel feature space
Development of advanced anomaly detection and failure diagnosis technologies for spacecraft is a quite significant issue in the space industry, because the space environment is ha...
Ryohei Fujimaki, Takehisa Yairi, Kazuo Machida
KDD
2005
ACM
157views Data Mining» more  KDD 2005»
14 years 12 months ago
A fast kernel-based multilevel algorithm for graph clustering
Graph clustering (also called graph partitioning) -- clustering the nodes of a graph -- is an important problem in diverse data mining applications. Traditional approaches involve...
Inderjit S. Dhillon, Yuqiang Guan, Brian Kulis
KDD
2005
ACM
166views Data Mining» more  KDD 2005»
14 years 12 months ago
A general model for clustering binary data
Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This p...
Tao Li
KDD
2005
ACM
182views Data Mining» more  KDD 2005»
14 years 12 months ago
Making holistic schema matching robust: an ensemble approach
The Web has been rapidly "deepened" by myriad searchable databases online, where data are hidden behind query interfaces. As an essential task toward integrating these m...
Bin He, Kevin Chen-Chuan Chang
KDD
2005
ACM
90views Data Mining» more  KDD 2005»
14 years 12 months ago
Variable latent semantic indexing
Anirban Dasgupta, Ravi Kumar, Prabhakar Raghavan, ...
KDD
2005
ACM
124views Data Mining» more  KDD 2005»
14 years 12 months ago
Scalable discovery of hidden emails from large folders
The popularity of email has triggered researchers to look for ways to help users better organize the enormous amount of information stored in their email folders. One challenge th...
Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou
KDD
2005
ACM
170views Data Mining» more  KDD 2005»
14 years 12 months ago
Parallel mining of closed sequential patterns
Discovery of sequential patterns is an essential data mining task with broad applications. Among several variations of sequential patterns, closed sequential pattern is the most u...
Shengnan Cong, Jiawei Han, David A. Padua
KDD
2005
ACM
141views Data Mining» more  KDD 2005»
14 years 12 months ago
Fast window correlations over uncooperative time series
Richard Cole, Dennis Shasha, Xiaojian Zhao