Feature hashing for large scale multitask learning

15 years 9 months ago

Download www.cs.mcgill.ca

Empirical evidence suggests that hashing is an effective strategy for dimensionality reduction and practical nonparametric estimation. In this paper we provide exponential tail bounds for feature hashing and show that the interaction between random subspaces is negligible with high probability. We demonstrate the feasibility of this approach with experimental results for a new use case — multitask learning with hundreds of thousands of tasks.

Kilian Q. Weinberger, Anirban Dasgupta, John Langf

Real-time Traffic

Exponential Tail Bounds | Feature Hashing | ICML 2009 | Machine Learning | Practical Nonparametric Estimation |

claim paper

» Learning Forgiving Hash Functions Algorithms and Large Scale Tests

» Joint Feature Selection in Distributed Stochastic Learning for LargeScale Discriminative T...

» Boosting MultiTask Weak Learners with Applications to Textual and Social Data

» Minimum Description Length Penalization for Group and MultiTask Sparse Learning

» Multitask Feature Selection Using the Multiple Inclusion Criterion MIC

» CompactKdt Compact signatures for accurate large scale object recognition

» Boosted MultiTask Learning for Face Verification With Applications to Web Image and Video ...

Post Info
More Details (n/a)

Added	19 May 2010
Updated	19 May 2010
Type	Conference
Year	2009
Where	ICML
Authors	Kilian Q. Weinberger, Anirban Dasgupta, John Langford, Alexander J. Smola, Josh Attenberg

Comments (0)

Sciweavers

Feature hashing for large scale multitask learning

Exponential Tail Bounds | Feature Hashing | ICML 2009 | Machine Learning | Practical Nonparametric Estimation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers