Bayesian sparse sampling for on-line reward optimization

16 years 7 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration versus exploitation tradeoff. Our approach combines sparse sampling with Bayesian exploration to achieve improved decision making while controlling computational cost. The idea is to grow a sparse lookahead tree, intelligently, by exploiting information in a Bayesian posterior--rather than enumerate action branches (standard sparse sampling) or compensate myopically (value of perfect information). The outcome is a flexible, practical technique for improving action selection in simple reinforcement learning scenarios.

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D

Real-time Traffic

Bayes Optimal Decision | ICML 2005 | Machine Learning | Sparse Lookahead Tree | Standard Sparse |

claim paper

» Optimal Rewards versus LeafEvaluation Heuristics in Planning Agents

» Large Scale Bayesian Inference and Experimental Design for Sparse Linear Models

» NearBayesian exploration in polynomial time

» Action Selection in Bayesian Reinforcement Learning

» Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

» Visual Learning Given Sparse Data of Unknown Complexity

» Multiactivity Tracking in LLE Body Pose Space

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2005
Where	ICML
Authors	Tao Wang, Daniel J. Lizotte, Michael H. Bowling, Dale Schuurmans

Comments (0)

Sciweavers

Bayesian sparse sampling for on-line reward optimization

Bayes Optimal Decision | ICML 2005 | Machine Learning | Sparse Lookahead Tree | Standard Sparse |

Explore & Download

Productivity Tools

Sciweavers