Sciweavers

1234 search results - page 25 / 247
» Multi-criteria Reinforcement Learning
Sort
View
134
Voted
JAIR
2011
144views more  JAIR 2011»
14 years 10 months ago
Non-Deterministic Policies in Markovian Decision Processes
Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...
Mahdi Milani Fard, Joelle Pineau
EWRL
2008
15 years 5 months ago
Bayesian Reward Filtering
A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...
Matthieu Geist, Olivier Pietquin, Gabriel Fricout
GECCO
2004
Springer
122views Optimization» more  GECCO 2004»
15 years 9 months ago
Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems
This paper introduces a gradient-based reward prediction update mechanism to the XCS classifier system as applied in neuralnetwork type learning and function approximation mechani...
Martin V. Butz, David E. Goldberg, Pier Luca Lanzi
104
Voted
ICML
1997
IEEE
16 years 4 months ago
Exponentiated Gradient Methods for Reinforcement Learning
Doina Precup, Richard S. Sutton