Search Sciweavers | Sciweavers

248 search results - page 6 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

189

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

179

click to vote

MP
2006

107views more MP 2006»

Convergence theory for nonconvex stochastic programming with an application to mixed logit

15 years 7 months ago

Download www.fundp.ac.be

Monte Carlo methods have been used extensively in the area of stochastic programming. As with other methods that involve a level of uncertainty, theoretical properties are required...

Fabian Bastin, Cinzia Cirillo, Philippe L. Toint

claim paper

Read More »

172

click to vote

NIPS
2008

150views Information Technology» more NIPS 2008»

Fast Rates for Regularized Objectives

15 years 8 months ago

Download ttic.uchicago.edu

We study convergence properties of empirical minimization of a stochastic strongly convex objective, where the stochastic component is linear. We show that the value attained by t...

Karthik Sridharan, Shai Shalev-Shwartz, Nathan Sre...

claim paper

Read More »

200

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

161

click to vote

WSC
2004

74views Modeling And Simulation» more WSC 2004»

Retrospective Approximation Algorithms for the Multidimensional Stochastic Root-Finding Problem

15 years 8 months ago

Download www.informs-sim.org

The stochastic root-finding problem (SRFP) is that of solving a system of q equations in q unknowns using only an oracle that provides estimates of the function values. This paper...

Raghu Pasupathy, Bruce W. Schmeiser

claim paper

Read More »

« Prev « First page 6 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers