Search Sciweavers | Sciweavers

802 search results - page 5 / 161

» Experts in a Markov Decision Process

127

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 6 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

169

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 6 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

135

Voted

NIPS
2001

138views Information Technology» more NIPS 2001»

Infinite Mixtures of Gaussian Process Experts

15 years 6 months ago

Download books.nips.cc

We present an extension to the Mixture of Experts (ME) model, where the individual experts are Gaussian Process (GP) regression models. Using an input-dependent adaptation of the ...

Carl Edward Rasmussen, Zoubin Ghahramani

claim paper

Read More »

144

Voted

ICMAS
2000

146views Intelligent Agents» more ICMAS 2000»

Communication in Multi-Agent Markov Decision Processes

15 years 6 months ago

Download mas.cs.umass.edu

In this paper, we formulate agent's decision process under the framework of Markov decision processes, and in particular, the multi-agent extension to Markov decision process...

Ping Xuan, Victor R. Lesser, Shlomo Zilberstein

claim paper

Read More »

139

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

15 years 5 months ago

Download infoscience.epfl.ch

We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...

Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec

claim paper

Read More »

« Prev « First page 5 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers