Sciweavers

361 search results - page 7 / 73
» Approximate counting by dynamic programming
Sort
View
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
13 years 2 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas
ICML
2006
IEEE
14 years 1 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup
FLAIRS
2009
13 years 5 months ago
Dynamic Programming Approximations for Partially Observable Stochastic Games
Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes wit...
Akshat Kumar, Shlomo Zilberstein
AUTOMATICA
2010
76views more  AUTOMATICA 2010»
13 years 7 months ago
Approximate dynamic programming with a fuzzy parameterization
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
AUTOMATICA
2006
115views more  AUTOMATICA 2006»
13 years 7 months ago
Approximate robust dynamic programming and robustly stable MPC
Jakob Björnberg, Moritz Diehl