Sciweavers

827 search results - page 34 / 166
» Variational methods for Reinforcement Learning
Sort
View
ATAL
2006
Springer
13 years 11 months ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
ICML
2010
IEEE
13 years 5 months ago
Constructing States for Reinforcement Learning
POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...
M. M. Hassan Mahmud
CEC
2008
IEEE
14 years 2 months ago
Creating edge detectors by evolutionary reinforcement learning
— In this article we present results from experiments where a edge detector was learned from scratch by EANT2, a method for evolutionary reinforcement learning. The detector is c...
Nils T. Siebel, Sven Grünewald, Gerald Sommer
COLT
2004
Springer
14 years 1 months ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
GECCO
2006
Springer
159views Optimization» more  GECCO 2006»
13 years 11 months ago
Standard and averaging reinforcement learning in XCS
This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...
Pier Luca Lanzi, Daniele Loiacono