Sciweavers

995 search results - page 33 / 199
» nips 2007
Sort
View
NIPS
2004
13 years 11 months ago
Maximum-Margin Matrix Factorization
We present a novel approach to collaborative prediction, using low-norm instead of low-rank factorizations. The approach is inspired by, and has strong connections to, large-margi...
Nathan Srebro, Jason D. M. Rennie, Tommi Jaakkola
NIPS
2004
13 years 11 months ago
Density Level Detection is Classification
We show that anomaly detection can be interpreted as a binary classification problem. Using this interpretation we propose a support vector machine (SVM) for anomaly detection. We...
Ingo Steinwart, Don R. Hush, Clint Scovel
NIPS
2004
13 years 11 months ago
The Convergence of Contrastive Divergences
This paper analyses the Contrastive Divergence algorithm for learning statistical parameters. We relate the algorithm to the stochastic approximation literature. This enables us t...
Alan L. Yuille
NIPS
2001
13 years 11 months ago
Reinforcement Learning with Long Short-Term Memory
This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...
Bram Bakker
NIPS
2001
13 years 11 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar