Search Sciweavers | Sciweavers

509 search results - page 21 / 102

» Compositional Models for Reinforcement Learning

158

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

179

Voted

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

16 years 1 days ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

192

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 7 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

185

click to vote

CORR
2011
Springer

194views Education» more CORR 2011»

Accelerating Reinforcement Learning through Implicit Imitation

14 years 10 months ago

Download www.aaai.org

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...

Craig Boutilier, Bob Price

claim paper

Read More »

175

Voted

IROS
2007
IEEE

157views Robotics» more IROS 2007»

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space

16 years 1 months ago

Download www.informatik.uni-freiburg.de

— In this paper, we present an approach that applies the reinforcement learning principle to the problem of learning height control policies for aerial blimps. In contrast to pre...

Axel Rottmann, Christian Plagemann, Peter Hilgers,...

claim paper

Read More »

« Prev « First page 21 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers