Sciweavers

208 search results - page 16 / 42
» nips 2008
Sort
View
NIPS
2008
13 years 11 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
NIPS
2008
13 years 11 months ago
Tracking Changing Stimuli in Continuous Attractor Neural Networks
Continuous attractor neural networks (CANNs) are emerging as promising models for describing the encoding of continuous stimuli in neural systems. Due to the translational invaria...
C. C. Alan Fung, K. Y. Michael Wong, Si Wu
NIPS
2008
13 years 11 months ago
Exact Convex Confidence-Weighted Learning
Confidence-weighted (CW) learning [6], an online learning method for linear classifiers, maintains a Gaussian distributions over weight vectors, with a covariance matrix that repr...
Koby Crammer, Mark Dredze, Fernando Pereira
NIPS
2008
13 years 11 months ago
Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation
Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...
Dotan Di Castro, Dmitry Volkinshtein, Ron Meir
NIPS
2008
13 years 11 months ago
Spectral Hashing
Semantic hashing[1] seeks compact binary codes of data-points so that the Hamming distance between codewords correlates with semantic similarity. In this paper, we show that the p...
Yair Weiss, Antonio Torralba, Robert Fergus