Search Sciweavers | Sciweavers

248 search results - page 18 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

161

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

213

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

224

click to vote

ICPPW
2009
IEEE

196views Distributed And Parallel Com...» more ICPPW 2009»

Dynamic Control and Resource Allocation in Wireless-Infrastructured Distributed Cellular Networks with OFDMA

16 years 1 months ago

Download www.signal.uu.se

—In this paper, we consider joint optimization of end-to-end data transmission and resource allocation for Wireless-Infrastructured Distributed Cellular Networks (WIDCNs), where ...

Lei You, Ping Wu, Mei Song, Junde Song, Yong Zhang

claim paper

Read More »

221

click to vote

CORR
2011
Springer

204views Education» more CORR 2011»

Accelerated Dual Descent for Network Optimization

15 years 1 months ago

Download web.mit.edu

—Dual descent methods are commonly used to solve network optimization problems because their implementation can be distributed through the network. However, their convergence rat...

Michael Zargham, A. Ribeiro, Ali Jadbabaie, Asuman...

claim paper

Read More »

201

click to vote

IAT
2006
IEEE

148views Intelligent Agents» more IAT 2006»

An Approximate Algorithm for Resource Allocation Using Combinatorial Auctions

16 years 1 months ago

Download tmullen.ist.psu.edu

Combinatorial Auctions (CAs), where users bid on combination of items, have emerged as a useful tool for resource allocation in distributed systems. However, two main difficulties...

Viswanath Avasarala, Himanshu Polavarapu, Tracy Mu...

claim paper

Read More »

« Prev « First page 18 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers