Sciweavers

1234 search results - page 60 / 247
» Multi-criteria Reinforcement Learning
Sort
View
ECML
2006
Springer
15 years 6 months ago
Reinforcement Learning for MDPs with Constraints
In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...
Peter Geibel
126
Voted
ECML
2004
Springer
15 years 9 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
KCAP
2009
ACM
15 years 10 months ago
Interactively shaping agents via human reinforcement: the TAMER framework
As computational learning agents move into domains that incur real costs (e.g., autonomous driving or financial investment), it will be necessary to learn good policies without n...
W. Bradley Knox, Peter Stone
137
Voted
COLT
1994
Springer
15 years 8 months ago
Efficient Reinforcement Learning
Realistic domains for learning possess regularities that make it possible to generalize experience across related states. This paper explores an environment-modeling framework tha...
Claude-Nicolas Fiechter
125
Voted
AAAI
1998
15 years 5 months ago
A Framework for Reinforcement Learning on Real Robots
Learning on real robots in an real, unaltered environment provides an extremely challenging problem. Many of the simplifying assumptions made in other areas of learning cannot be ...
William D. Smart, Leslie Pack Kaelbling