Search Sciweavers | Sciweavers

361 search results - page 7 / 73

» Approximate counting by dynamic programming

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 2 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

14 years 1 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

FLAIRS
2009

156views Artificial Intelligence» more FLAIRS 2009»

Dynamic Programming Approximations for Partially Observable Stochastic Games

13 years 5 months ago

Download rbr.cs.umass.edu

Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes wit...

Akshat Kumar, Shlomo Zilberstein

claim paper

Read More »

click to vote

AUTOMATICA
2010

76views more AUTOMATICA 2010»

Approximate dynamic programming with a fuzzy parameterization

13 years 7 months ago