Search Sciweavers | Sciweavers

31 search results - page 2 / 7

» Algorithms and Bounds for Rollout Sampling Approximate Polic...

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

13 years 8 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

click to vote

AAAI
2012

249views Intelligent Agents» more AAAI 2012»

Generalized Sampling and Variance in Counterfactual Regret Minimization

11 years 9 months ago

Download poker.cs.ualberta.ca

In large extensive form games with imperfect information, Counterfactual Regret Minimization (CFR) is a popular, iterative algorithm for computing approximate Nash equilibria. Whi...

Richard G. Gibson, Marc Lanctot, Neil Burch, Duane...

claim paper

Read More »

click to vote

CDC
2008
IEEE

206views Control Systems» more CDC 2008»

Approximate dynamic programming using support vector regression

14 years 1 months ago

Download web.mit.edu

— This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architect...

Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar

claim paper

Read More »

click to vote

CORR
2008
Springer

151views Education» more CORR 2008»

CoSaMP: Iterative signal recovery from incomplete and inaccurate samples

13 years 7 months ago

Download www.acm.caltech.edu

Compressive sampling offers a new paradigm for acquiring signals that are compressible with respect to an orthonormal basis. The major algorithmic challenge in compressive sampling...

Joel A. Tropp, Deanna Needell

claim paper

Read More »

« Prev « First page 2 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers