Robust Learning for Adaptive Programs by Leveraging Program Structure

13 years 10 months ago

Download web.engr.oregonstate.edu

Abstract--We study how to effectively integrate reinforcement learning (RL) and programming languages via adaptation-based programming, where programs can include non-deterministic structures that can be automatically optimized via RL. Prior work has optimized adaptive programs by defining an induced sequential decision process to which standard RL is applied. Here we show that the success of this approach is highly sensitive to the specific program structure, where even seemingly minor program transformations can lead to failure. This sensitivity makes it extremely difficult for a non-RL-expert to write effective adaptive programs. In this paper, we study a more robust learning approach, where the key idea is to leverage information about program structure in order to define a more informative decision process and to improve the SARSA() RL algorithm. Our empirical results show significant benefits for this approach.

Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig

Real-time Traffic

Adaptive Programs | Decision Process | ICMLA 2010 | Machine Learning | Program Structure |

claim paper

Post Info
More Details (n/a)

Added	12 Feb 2011
Updated	12 Feb 2011
Type	Journal
Year	2010
Where	ICMLA
Authors	Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig

Comments (0)

Sciweavers

Robust Learning for Adaptive Programs by Leveraging Program Structure

Adaptive Programs | Decision Process | ICMLA 2010 | Machine Learning | Program Structure |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers