How efficient is estimation with missing data?

14 years 10 months ago

Download mirlab.org

In this paper, we present a new evaluation approach for missing data techniques (MDTs) where the efﬁciency of those are investigated using listwise deletion method as reference. We experiment on classiﬁcation problems and calculate misclassiﬁcation rates (MR) for different missing data percentages (MDP) using a missing completely at random (MCAR) scheme. We compare three MDTs: pairwise deletion (PW), mean imputation (MI) and a maximum likelihood method that we call complete expectation maximization (CEM). We use a synthetic dataset, the Iris dataset and the Pima Indians Diabetes dataset. We train a Gaussian mixture model (GMM). We test the trained GMM for two cases, in which test dataset is missing or complete. The results show that CEM is the most efﬁcient method in both cases while MI is the worst performer of the three. PW and CEM proves to be more stable, in particular for higher MDP values than MI.

Seliz G. Karadogan, Letizia Marchegiani, Lars Kai

Real-time Traffic

Complete Expectation Maximization | Dataset | ICASSP 2011 | Listwise Deletion Method | Signal Processing |

claim paper

» Efficient Methods for Dealing with Missing Data in Supervised Learning

» Data envelopment analysis with missing values An interval DEA approach

» DEMS a data mining based technique to handle missing data in mobile sensor network applica...

» Nonlinear TimeSeries Prediction with Missing and Noisy Data

» How accurate are the time delay estimates in gravitational lensing

» Estimating Probabilities in Recommendation Systems

» Physically Consistent and Efficient Variational Denoising of Image Fluid Flow Estimates

» A New Hardware Monitor Design to Measure Data StructureSpecific Cache Eviction Information

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Seliz G. Karadogan, Letizia Marchegiani, Lars Kai Hansen, Jan Larsen

Comments (0)

Sciweavers

How efficient is estimation with missing data?

Complete Expectation Maximization | Dataset | ICASSP 2011 | Listwise Deletion Method | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers