Search Sciweavers | Sciweavers

248 search results - page 9 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

143

click to vote

GLOBECOM
2006
IEEE

86views Communications» more GLOBECOM 2006»

Power Optimal Opportunistic Scheduling

16 years 1 months ago

Download www.tcs.tifr.res.in

Abstract— In this paper, we propose a power optimal opportunistic scheduling scheme for a multiuser single hop Time Division Multiple Access (TDMA) system. We formulate the probl...

Abhijeet Bhorkar, Abhay Karandikar, Vivek S. Borka...

claim paper

Read More »

187

click to vote

CDC
2010
IEEE

104views Control Systems» more CDC 2010»

Single timescale regularized stochastic approximation schemes for monotone Nash games under uncertainty

15 years 2 months ago

Download netfiles.uiuc.edu

Abstract-- In this paper, we consider the distributed computation of equilibria arising in monotone stochastic Nash games over continuous strategy sets. Such games arise in setting...

Jayash Koshal, Angelia Nedic, Uday V. Shanbhag

claim paper

Read More »

193

click to vote

CDC
2010
IEEE

167views Control Systems» more CDC 2010»

Numerical methods for the optimization of nonlinear stochastic delay systems, and an application to internet regulation

15 years 2 months ago

Download www.dam.brown.edu

The Markov chain approximation method is an effective and widely used approach for computing optimal values and controls for stochastic systems. It was extended to nonlinear (and p...

Harold J. Kushner

claim paper

Read More »

271

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 7 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

188

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 1 months ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

« Prev « First page 9 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers