The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

15 years 1 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample-complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement-learning algorithm for factoredstate problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches is demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem.

Carlos Diuk, Lihong Li, Bethany R. Leffler

Real-time Traffic

ICML 2009 | Lower Bound | Machine Learning | Robotics Navigation Problem | Sample-complexity Upper Bound |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2009
Where	ICML
Authors	Carlos Diuk, Lihong Li, Bethany R. Leffler

Comments (0)

Sciweavers

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

ICML 2009 | Lower Bound | Machine Learning | Robotics Navigation Problem | Sample-complexity Upper Bound |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers