Very sparse random projections

14 years 12 months ago

Download www-stat.stanford.edu

There has been considerable interest in random projections, an approximate algorithm for estimating distances between pairs of points in a high-dimensional vector space. Let A Rn?D be our n points in D dimensions. The method multiplies A by a random matrix R RD?k , reducing the D dimensions down to just k for speeding up the computation. R typically consists of entries of standard normal N(0, 1). It is well known that random projections preserve pairwise distances (in the expectation). Achlioptas proposed sparse random projections by replacing the N(0, 1) entries in R with entries in {-1, 0, 1} with probabilities {1 6 , 2 3 , 1 6 }, achieving a threefold speedup in processing time. We recommend using R of entries in {-1, 0, 1} with probabilities { 1 2 D , 1- 1 D , 1 2 D } for achieving a significant Dfold speedup, with little loss in accuracy. Categories and Subject Descriptors H.2.8 [Database Applications]: Data Mining General Terms Algorithms, Performance, Theory Keywords Rando...

Ping Li, Trevor Hastie, Kenneth Ward Church

Real-time Traffic

Data Mining | KDD 2006 | Keywords Random Projections | Random Projections | Sparse Random Projections |

claim paper

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2006
Where	KDD
Authors	Ping Li, Trevor Hastie, Kenneth Ward Church

Comments (0)

Sciweavers

Very sparse random projections

Data Mining | KDD 2006 | Keywords Random Projections | Random Projections | Sparse Random Projections |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers