Intra-Option Learning about Temporally Abstract Actions

15 years 3 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 dprecup@cs.umass.edu Satinder Singh Department of Computer Science University of Colorado Boulder, CO 80309-0430 baveja@cs.colorado.edu Several researchers have proposed modeling ly abstract actions in reinforcement learning by the combination of a policy and a termination condition, which we refer to as an option. Value functions over options and models of options can be learned using methods designed for semi-Markov decision processes (SMDPs). However, all these methods require an option to be executed to termination. In this paper we explore methods that learn about an option from small fragments of experience consistent with that option, even if the option itself is not executed. We call these methods intra-option learning m...

Richard S. Sutton, Doina Precup, Satinder P. Singh

Real-time Traffic

Computer Science University | ICML 1998 | Intra-option Learning Methods | Machine Learning | SMDP Methods |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	1998
Where	ICML
Authors	Richard S. Sutton, Doina Precup, Satinder P. Singh

Comments (0)

Sciweavers

Intra-Option Learning about Temporally Abstract Actions

Computer Science University | ICML 1998 | Intra-option Learning Methods | Machine Learning | SMDP Methods |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers