Sciweavers

771 search results - page 94 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
166
Voted
UAI
2007
15 years 5 months ago
Automatic Generation of Four-part Harmony
This paper introduces decision-theoretic planning techniques into automatic music generation. Markov decision processes (MDPs) are a mathematical model of planning under uncertain...
Liangrong Yi, Judy Goldsmith
ML
2002
ACM
146views Machine Learning» more  ML 2002»
15 years 3 months ago
Variable Resolution Discretization in Optimal Control
Abstract. The problemof state abstractionis of centralimportancein optimalcontrol,reinforcement learning and Markov decision processes. This paper studies the case of variable reso...
Rémi Munos, Andrew W. Moore
ICML
2001
IEEE
16 years 4 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
139
Voted
AIPS
2009
15 years 5 months ago
Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities
When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...
Karina Valdivia Delgado, Scott Sanner, Leliane Nun...
CDC
2009
IEEE
134views Control Systems» more  CDC 2009»
15 years 8 months ago
Event-based control using quadratic approximate value functions
Abstract— In this paper we consider several problems involving control with limited actuation and sampling rates. Event-based control has emerged as an attractive approach for ad...
Randy Cogill