A Distributed Kernel Summation Framework for General-Dimension Machine Learning

12 years 9 months ago

Download www.cc.gatech.edu

Kernel summations are a ubiquitous key computational bottleneck in many data analysis methods. In this paper, we attempt to marry, for the ﬁrst time, the best relevant techniques in parallel computing, where kernel summations are in low dimensions, with the best general-dimension algorithms from the machine learning literature. We provide the ﬁrst distributed implementation of kernel summation framework that can utilize: 1) various types of deterministic and probabilistic approximations that may be suitable for low and high-dimensional problems with a large number of data points; 2) any multi-dimensional binary tree using both distributed memory and shared memory parallelism; 3) a dynamic load balancing scheme to adjust work imbalances during the computation. Our hybrid MPI/OpenMP codebase has wide applicability in providing a general framework to accelerate the computation of many popular machine learning methods. Our experiments show scalability results for kernel density estima...

Dongryeol Lee, Richard W. Vuduc, Alexander G. Gray

Real-time Traffic

Data Mining | Digital Sky Survey | Kernel Density Estimation | SDM 2012 | Sky Survey Data |

claim paper

Post Info
More Details (n/a)

Added	29 Sep 2012
Updated	29 Sep 2012
Type	Journal
Year	2012
Where	SDM
Authors	Dongryeol Lee, Richard W. Vuduc, Alexander G. Gray

Comments (0)

Sciweavers

A Distributed Kernel Summation Framework for General-Dimension Machine Learning

Data Mining | Digital Sky Survey | Kernel Density Estimation | SDM 2012 | Sky Survey Data |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers