Sciweavers

802 search results - page 5 / 161
» Experts in a Markov Decision Process
Sort
View
ICML
2006
IEEE
14 years 8 months ago
Qualitative reinforcement learning
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
Arkady Epshteyn, Gerald DeJong
IJCAI
2007
13 years 9 months ago
Bayesian Inverse Reinforcement Learning
Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...
Deepak Ramachandran, Eyal Amir
NIPS
2001
13 years 9 months ago
Infinite Mixtures of Gaussian Process Experts
We present an extension to the Mixture of Experts (ME) model, where the individual experts are Gaussian Process (GP) regression models. Using an input-dependent adaptation of the ...
Carl Edward Rasmussen, Zoubin Ghahramani
ICMAS
2000
13 years 9 months ago
Communication in Multi-Agent Markov Decision Processes
In this paper, we formulate agent's decision process under the framework of Markov decision processes, and in particular, the multi-agent extension to Markov decision process...
Ping Xuan, Victor R. Lesser, Shlomo Zilberstein
CORR
2010
Springer
127views Education» more  CORR 2010»
13 years 7 months ago
Mean field for Markov Decision Processes: from Discrete to Continuous Optimization
We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...
Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec