Approximate Policy Iteration

52

IJCAI
2003

147views Artificial Intelligence» more IJCAI 2003»

Approximate Policy Iteration using Large-Margin Classifiers

14 years 7 months ago

We present an approximate policy iteration algorithm that uses rollouts to estimate the value of each action under a given policy in a subset of states and a classifier to general...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

57

click to vote

CDC
2008
IEEE

206views Control Systems» more CDC 2008»

Approximate dynamic programming using support vector regression

15 years 11 days ago

Download web.mit.edu

— This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architect...

Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar

claim paper

Read More »

96

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

15 years 2 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

39

click to vote

ICML
2003
IEEE

174views Machine Learning» more ICML 2003»

Error Bounds for Approximate Policy Iteration

15 years 6 months ago