Sciweavers

ICDM
2006
IEEE
116views Data Mining» more  ICDM 2006»
14 years 2 months ago
Improving Personalization Solutions through Optimal Segmentation of Customer Bases
On the Web, where the search costs are low and the competition is just a mouse click away, it is crucial to segment the customers intelligently in order to offer more targeted and...
Tianyi Jiang, Alexander Tuzhilin
ICDM
2006
IEEE
91views Data Mining» more  ICDM 2006»
14 years 2 months ago
Entropy-based Concept Shift Detection
When monitoring sensory data (e.g., from a wearable device) the context oftentimes changes abruptly: people move from one situation (e.g., working quietly in their office) to ano...
Peter Vorburger, Abraham Bernstein
ICDM
2006
IEEE
98views Data Mining» more  ICDM 2006»
14 years 2 months ago
What is the Dimension of Your Binary Data?
Many 0/1 datasets have a very large number of variables; however, they are sparse and the dependency structure of the variables is simpler than the number of variables would sugge...
Nikolaj Tatti, Taneli Mielikäinen, Aristides ...
ICDM
2006
IEEE
133views Data Mining» more  ICDM 2006»
14 years 2 months ago
TRIAS - An Algorithm for Mining Iceberg Tri-Lattices
In this paper, we present the foundations for mining frequent tri-concepts, which extend the notion of closed itemsets to three-dimensional data to allow for mining folksonomies. ...
Robert Jäschke, Andreas Hotho, Christoph Schm...
ICDM
2006
IEEE
151views Data Mining» more  ICDM 2006»
14 years 2 months ago
Decision Trees for Functional Variables
Classification problems with functionally structured input variables arise naturally in many applications. In a clinical domain, for example, input variables could include a time...
Suhrid Balakrishnan, David Madigan
ICDM
2006
IEEE
84views Data Mining» more  ICDM 2006»
14 years 2 months ago
Exploratory Under-Sampling for Class-Imbalance Learning
Under-sampling is a class-imbalance learning method which uses only a subset of major class examples and thus is very efficient. The main deficiency is that many major class exa...
Xu-Ying Liu, Jianxin Wu, Zhi-Hua Zhou
ICDM
2006
IEEE
182views Data Mining» more  ICDM 2006»
14 years 2 months ago
Active Learning to Maximize Area Under the ROC Curve
In active learning, a machine learning algorithm is given an unlabeled set of examples U, and is allowed to request labels for a relatively small subset of U to use for training. ...
Matt Culver, Kun Deng, Stephen D. Scott
ICDM
2006
IEEE
127views Data Mining» more  ICDM 2006»
14 years 2 months ago
Optimal k-Anonymity with Flexible Generalization Schemes through Bottom-up Searching
In recent years, a major thread of research on kanonymity has focused on developing more flexible generalization schemes that produce higher-quality datasets. In this paper we in...
Tiancheng Li, Ninghui Li
ICDM
2006
IEEE
149views Data Mining» more  ICDM 2006»
14 years 2 months ago
P3C: A Robust Projected Clustering Algorithm
Gabriela Moise, Jörg Sander, Martin Ester
ICDM
2006
IEEE
92views Data Mining» more  ICDM 2006»
14 years 2 months ago
Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams
Data stream values are often associated with multiple aspects. For example, each value from environmental sensors may have an associated type (e.g., temperature, humidity, etc) as...
Jimeng Sun, Spiros Papadimitriou, Philip S. Yu