Search Sciweavers | Sciweavers

377 search results - page 5 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

click to vote

STOC
2007
ACM

132views Algorithms» more STOC 2007»

On the convergence of Newton's method for monotone systems of polynomial equations

14 years 8 months ago

Download www7.informatik.tu-muenchen.de

Monotone systems of polynomial equations (MSPEs) are systems of fixed-point equations X1 = f1(X1, . . . , Xn), . . . , Xn = fn(X1, . . . , Xn) where each fi is a polynomial with p...

Stefan Kiefer, Michael Luttenberger, Javier Esparz...

claim paper

Read More »

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 2 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 9 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

click to vote

CIA
2007
Springer

143views Intelligent Agents» more CIA 2007»

Multi-agent Learning Dynamics: A Survey

14 years 1 months ago

Download michaelkaisers.com

Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...

H. Jaap van den Herik, Daniel Hennes, Michael Kais...

claim paper

Read More »

click to vote

AOR
2010

122views Artificial Intelligence» more AOR 2010»

Speeding up Stochastic Dynamic Programming with Zero-Delay Convolution

13 years 5 months ago

Download www.cs.clemson.edu

We show how a technique from signal processing known as zero-delay convolution can be used to develop more efficient dynamic programming algorithms for a broad class of stochastic...

Brian C. Dean

claim paper

Read More »

« Prev « First page 5 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers