Massive-Scale Kernel Discriminant Analysis: Mining for Quasars

15 years 8 months ago

Download www.cc.gatech.edu

We describe a fast algorithm for kernel discriminant analysis, empirically demonstrating asymptotic speed-up over the previous best approach. We achieve this with a new pattern of processing data stored in hierarchical trees, which incurs low overhead while helping to prune unnecessary work once classification results can be shown, and the use of the Epanechnikov kernel, which allows additional pruning between portions of data shown to be far apart or very near each other. Further, our algorithm may share work between multiple simultaneous bandwidth computations, thus facilitating a rudimentary but nonetheless quick and effective means of bandwidth optimization. We apply a parallelized implementation of our algorithm to a large data set (40 million points in 4D) from the Sloan Digital Sky Survey, identifying approximately one million quasars with high accuracy. This exceeds the previous largest catalog of quasars in size by a factor of ten.

Ryan Riegel, Alexander Gray, Gordon Richards

Real-time Traffic

Algorithm | Data Mining | Kernel Discriminant Analysis | SDM 2008 | Simultaneous Bandwidth Computations |

claim paper

» Dimensionality Reduction Using Kernel Pooled Local Discriminant Information

» Nonsparse Multiple Kernel Learning for Fisher Discriminant Analysis

» Constructing Nonlinear Discriminants from Multiple Data Views

» Hierarchical Linear Discriminant Analysis for Beamforming

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	SDM
Authors	Ryan Riegel, Alexander Gray, Gordon Richards

Comments (0)

Sciweavers

Massive-Scale Kernel Discriminant Analysis: Mining for Quasars

Algorithm | Data Mining | Kernel Discriminant Analysis | SDM 2008 | Simultaneous Bandwidth Computations |

Explore & Download

Productivity Tools

Sciweavers