Sciweavers

PAKDD
2004
ACM
127views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Separating Structure from Interestingness
Condensed representations of pattern collections have been recognized to be important building blocks of inductive databases, a promising theoretical framework for data mining, and...
Taneli Mielikäinen
PAKDD
2004
ACM
121views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Secure Association Rule Sharing
Abstract. The sharing of association rules is often beneficial in industry, but requires privacy safeguards. One may decide to disclose only part of the knowledge and conceal stra...
Stanley R. M. Oliveira, Osmar R. Zaïane, Y&uu...
PAKDD
2004
ACM
143views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Compact Dual Ensembles for Active Learning
Generic ensemble methods can achieve excellent learning performance, but are not good candidates for active learning because of their different design purposes. We investigate how...
Amit Mandvikar, Huan Liu, Hiroshi Motoda
PAKDD
2004
ACM
183views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Constraint-Based Graph Clustering through Node Sequencing and Partitioning
This paper proposes a two-step graph partitioning method to discover constrained clusters with an objective function that follows the well-known minmax clustering principle. Compar...
Yu Qian, Kang Zhang, Wei Lai
PAKDD
2004
ACM
96views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Spectral Energy Minimization for Semi-supervised Learning
The use of unlabeled data to aid classification is important as labeled data is often available in limited quantity. Instead of utilizing training samples directly into semi-super...
Chun Hung Li, Zhi-Li Wu
PAKDD
2004
ACM
131views Data Mining» more  PAKDD 2004»
14 years 2 months ago
A Tree-Based Approach to the Discovery of Diagnostic Biomarkers for Ovarian Cancer
Computational diagnosis of cancer is a classification problem, and it has two special requirements on a learning algorithm: perfect accuracy and small number of features used in t...
Jinyan Li, Kotagiri Ramamohanarao
PAKDD
2004
ACM
83views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Providing Diversity in K-Nearest Neighbor Query Results
Abstract. Given a point query Q in multi-dimensional space, K-Nearest Neighbor (KNN) queries return the K closest answers in the database with respect to Q. In this scenario, it is...
Anoop Jain, Parag Sarda, Jayant R. Haritsa
PAKDD
2004
ACM
131views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Mining of Web-Page Visiting Patterns with Continuous-Time Markov Models
This paper presents a new prediction model for predicting when an online customer leaves a current page and which next Web page the customer will visit. The model can forecast the ...
Qiming Huang, Qiang Yang, Joshua Zhexue Huang, Mic...
PAKDD
2004
ACM
105views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Extracting and Explaining Biological Knowledge in Microarray Data
Paul J. Kennedy, Simeon J. Simoff, David B. Skilli...
PAKDD
2004
ACM
94views Data Mining» more  PAKDD 2004»
14 years 2 months ago
Clustering Multi-represented Objects with Noise
Abstract. Traditional clustering algorithms are based on one representation space, usually a vector space. However, in a variety of modern applications, multiple representations ex...
Karin Kailing, Hans-Peter Kriegel, Alexey Pryakhin...