Search Sciweavers | Sciweavers

6 search results - page 1 / 2

» Least absolute policy iteration for robust value function ap...

190

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

16 years 2 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

196

Voted

AAAI
2007

126views Intelligent Agents» more AAAI 2007»

Point-Based Policy Iteration

15 years 9 months ago

Download www.cs.duke.edu

We describe a point-based policy iteration (PBPI) algorithm for inﬁnite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...

Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...

claim paper

Read More »

213

click to vote

PKDD
2009
Springer

169views Data Mining» more PKDD 2009»

Hybrid Least-Squares Algorithms for Approximate Policy Evaluation

16 years 1 months ago

Download www.cs.umass.edu

The goal of approximate policy evaluation is to “best” represent a target value function according to a speciﬁc criterion. Temporal difference methods and Bellman residual m...

Jeffrey Johns, Marek Petrik, Sridhar Mahadevan

claim paper

Read More »

227

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

180

Voted

CORR
2010
Springer

170views Education» more CORR 2010»

Global Optimization for Value Function Approximation

15 years 7 months ago

Download www.cs.umass.edu

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bili...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers