Search Sciweavers | Sciweavers

85 search results - page 5 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

ESOP
2007
Springer

152views Programming Languages» more ESOP 2007»

Static Analysis by Policy Iteration on Relational Domains

14 years 1 months ago

Download minimal.inria.fr

We give a new practical algorithm to compute, in ﬁnite time, a ﬁxpoint (and often the least ﬁxpoint) of a system of equations in the abstract numerical domains of zones and t...

Stephane Gaubert, Eric Goubault, Ankur Taly, Sarah...

claim paper

Read More »

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

13 years 7 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

click to vote

TIT
2010

115views Education» more TIT 2010»

On resource allocation in fading multiple-access channels-an efficient approximate projection approach

13 years 2 months ago

Download web.mit.edu

We consider the problem of rate and power allocation in a multiple-access channel. Our objective is to obtain rate and power allocation policies that maximize a general concave ut...

Ali ParandehGheibi, Atilla Eryilmaz, Asuman E. Ozd...

claim paper

Read More »

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

13 years 8 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

« Prev « First page 5 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers