Sciweavers

58 search results - page 8 / 12
» A Dynamic Allocation Method of Basis Functions in Reinforcem...
Sort
View
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 2 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
13 years 3 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor
ECML
2005
Springer
14 years 2 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
TMI
2008
138views more  TMI 2008»
13 years 8 months ago
Dynamic Positron Emission Tomography Data-Driven Analysis Using Sparse Bayesian Learning
A method is presented for the analysis of dynamic positron emission tomography (PET) data using sparse Bayesian learning. Parameters are estimated in a compartmental framework usin...
Jyh-Ying Peng, John A. D. Aston, R. N. Gunn, Cheng...

Publication
222views
14 years 5 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis