A Fast Multivariate Nearest Neighbour Imputation Algorithm

14 years 19 days ago

Download www.cit.sunderland.ac.uk

— Imputation of missing data is important in many areas, such as reducing non-response bias in surveys and maintaining medical documentation. Nearest neighbour (NN) imputation algorithms replace the missing values within any particular observation by taking copies of the corresponding known values from the most similar observation found in the dataset. However, when NN algorithms are executed against large multivariate datasets the poor performance (program execution speed) of these algorithms can present major practical problems. We argue that these problems have not been sufficiently addressed, and we present a fast NN imputation algorithm that can employ any method for measuring the similarity between observations. The algorithm has been designed for the imputation of missing values in large multivariate datasets that contain many different missingness patterns with large proportions of missing data. The ideas underpinning the algorithm are explained in detail, and experiments are...

Norman Solomon, Giles Oatley, Kenneth McGarry

Real-time Traffic

Algorithm | Imputation Algorithm | Large Multivariate Datasets | WCE 2007 | WCE 2008 |

claim paper

Post Info
More Details (n/a)

Added	07 Nov 2010
Updated	07 Nov 2010
Type	Conference
Year	2007
Where	WCE
Authors	Norman Solomon, Giles Oatley, Kenneth McGarry

Comments (0)

Sciweavers

A Fast Multivariate Nearest Neighbour Imputation Algorithm

Algorithm | Imputation Algorithm | Large Multivariate Datasets | WCE 2007 | WCE 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers