Reinforcement Learning via AIXI Approximation

15 years 8 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a Monte Carlo Tree Search algorithm along with an agentspecific extension of the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a number of stochastic, unknown, and partially observable domains.

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si

Real-time Traffic

AAAI 2010 | General Reinforcement | Intelligent Agents | Reinforcement Learning Agents | Scalable General Reinforcement |

claim paper

Related Content

» A MonteCarlo AIXI Approximation

» Improving reinforcement learning function approximators via neuroevolution

» Reinforcement Learning in POMDPs via Direct Gradient Ascent

» Modelfree reinforcement learning as mixture learning

» Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian p...

» Gaussian Processes for Sample Efficient Reinforcement Learning with RMAXLike Exploration

» SampleEfficient Evolutionary Function Approximation for Reinforcement Learning

» On step sizes stochastic shortest paths and survival probabilities in Reinforcement Learni...

» Action Selection in Bayesian Reinforcement Learning

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	AAAI
Authors	Joel Veness, Kee Siong Ng, Marcus Hutter, David Silver

Comments (0)

Sciweavers

Reinforcement Learning via AIXI Approximation

AAAI 2010 | General Reinforcement | Intelligent Agents | Reinforcement Learning Agents | Scalable General Reinforcement |

Explore & Download

Productivity Tools

Sciweavers