Sciweavers

732 search results - page 68 / 147
» Mining Postal Addresses
Sort
View
ICDM
2006
IEEE
193views Data Mining» more  ICDM 2006»
14 years 1 months ago
Local Correlation Tracking in Time Series
We address the problem of capturing and tracking local correlations among time evolving time series. Our approach is based on comparing the local auto-covariance matrices (via the...
Spiros Papadimitriou, Jimeng Sun, Philip S. Yu
ICDM
2005
IEEE
133views Data Mining» more  ICDM 2005»
14 years 1 months ago
Summarization - Compressing Data into an Informative Representation
In this paper, we formulate the problem of summarization of a dataset of transactions with categorical attributes as an optimization problem involving two objective functions - co...
Varun Chandola, Vipin Kumar
ICDM
2005
IEEE
146views Data Mining» more  ICDM 2005»
14 years 1 months ago
Merging Interface Schemas on the Deep Web via Clustering Aggregation
We consider the problem of integrating a large number of interface schemas over the Deep Web, The scale of the problem and the diversity of the sources present serious challenges ...
Wensheng Wu, AnHai Doan, Clement T. Yu
ICIC
2005
Springer
14 years 1 months ago
Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning
In recent years, mining with imbalanced data sets receives more and more attentions in both theoretical and practical aspects. This paper introduces the importance of imbalanced da...
Hui Han, Wenyuan Wang, Binghuan Mao
KDD
1998
ACM
181views Data Mining» more  KDD 1998»
14 years 3 days ago
Approaches to Online Learning and Concept Drift for User Identification in Computer Security
The task in the computer security domain of anomaly detection is to characterize the behaviors of a computer user (the `valid', or `normal' user) so that unusual occurre...
Terran Lane, Carla E. Brodley