Sciweavers

132 search results - page 6 / 27
» Handling of incomplete data sets using ICA and SOM in data m...
Sort
View
ICDM
2009
IEEE
200views Data Mining» more  ICDM 2009»
13 years 5 months ago
Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
Abstract--Imbalanced data sets present a particular challenge to the data mining community. Often, it is the rare event that is of interest and the cost of misclassifying the rare ...
Suzan Koknar-Tezel, Longin Jan Latecki
ICDE
2009
IEEE
132views Database» more  ICDE 2009»
14 years 9 months ago
Using Anonymized Data for Classification
In recent years, anonymization methods have emerged as an important tool to preserve individual privacy when releasing privacy sensitive data sets. This interest in anonymization t...
Ali Inan, Murat Kantarcioglu, Elisa Bertino
SIGMOD
2000
ACM
212views Database» more  SIGMOD 2000»
14 years 5 days ago
SQLEM: Fast Clustering in SQL using the EM Algorithm
Clustering is one of the most important tasks performed in Data Mining applications. This paper presents an e cient SQL implementation of the EM algorithm to perform clustering in...
Carlos Ordonez, Paul Cereghini
ICRA
2010
IEEE
185views Robotics» more  ICRA 2010»
13 years 5 months ago
Heteroscedastic Gaussian processes for data fusion in large scale terrain modeling
This paper presents a novel approach to data fusion for stochastic processes that model spatial data. It addresses the problem of data fusion in the context of large scale terrain ...
Shrihari Vasudevan, Fabio T. Ramos, Eric Nettleton...
CIKM
2009
Springer
14 years 2 months ago
Frequent subgraph pattern mining on uncertain graph data
Graph data are subject to uncertainties in many applications due to incompleteness and imprecision of data. Mining uncertain graph data is semantically different from and computat...
Zhaonian Zou, Jianzhong Li, Hong Gao, Shuo Zhang