Search Sciweavers | Sciweavers

337 search results - page 50 / 68

» Mean-Variance Optimization in Markov Decision Processes

170

click to vote

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 12 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

174

click to vote

CORR
2010
Springer

98views Education» more CORR 2010»

Structure-Aware Stochastic Control for Transmission Scheduling

15 years 5 months ago

Download medianetlab.ee.ucla.edu

In this report, we consider the problem of real-time transmission scheduling over time-varying channels. We first formulate the transmission scheduling problem as a Markov decisio...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

153

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 6 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

158

click to vote

AIMSA
2004
Springer

104views Artificial Intelligence» more AIMSA 2004»

Towards Well-Defined Multi-agent Reinforcement Learning

15 years 9 months ago

Download userweb.port.ac.uk

Multi-agent reinforcement learning (MARL) is an emerging area of research. However, it lacks two important elements: a coherent view on MARL, and a well-defined problem objective. ...

Rinat Khoussainov

claim paper

Read More »

171

click to vote

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

15 years 7 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

« Prev « First page 50 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers