Large-scale deep unsupervised learning using graphics processors

15 years 3 months ago

Download ai.stanford.edu

The promise of unsupervised learning methods lies in their potential to use vast amounts of unlabeled data to learn complex, highly nonlinear models with millions of free parameters. We consider two well-known unsupervised learning models, deep belief networks (DBNs) and sparse coding, that have recently been applied to a flurry of machine learning applications (Hinton & Salakhutdinov, 2006; Raina et al., 2007). Unfortunately, current learning algorithms for both models are too slow for large-scale applications, forcing researchers to focus on smaller-scale models, or to use fewer training examples. In this paper, we suggest massively parallel methods to help resolve these problems. We argue that modern graphics processors far surpass the computational capabilities of multicore CPUs, and have the potential to revolutionize the applicability of deep unsupervised learning methods. We develop general principles for massively parallelizing unsupervised learning tasks using graphics pr...

Rajat Raina, Anand Madhavan, Andrew Y. Ng

Real-time Traffic

Deep Unsupervised Learning | ICML 2009 | Machine Learning | Unsupervised Learning Methods | Unsupervised Learning Models |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2009
Where	ICML
Authors	Rajat Raina, Anand Madhavan, Andrew Y. Ng

Comments (0)

Sciweavers

Large-scale deep unsupervised learning using graphics processors

Deep Unsupervised Learning | ICML 2009 | Machine Learning | Unsupervised Learning Methods | Unsupervised Learning Models |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers