Sciweavers

1234 search results - page 128 / 247
» Multi-criteria Reinforcement Learning
Sort
View
PRICAI
2000
Springer
15 years 8 months ago
Generating Hierarchical Structure in Reinforcement Learning from State Variables
This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...
Bernhard Hengst
AAAI
2007
15 years 6 months ago
Efficient Reinforcement Learning with Relocatable Action Models
Bethany R. Leffler, Michael L. Littman, Timothy Ed...