Search Sciweavers | Sciweavers

38 search results - page 4 / 8

» On the Convergence of Optimistic Policy Iteration

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

13 years 8 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

click to vote

CORR
2008
Springer

115views Education» more CORR 2008»

Adaptive Sum Power Iterative Waterfilling for MIMO Cognitive Radio Channels

13 years 7 months ago

Download wncg.org

Abstract--In this paper, the sum capacity of the Gaussian Multiple Input Multiple Output (MIMO) Cognitive Radio Channel (MCC) is expressed as a convex problem with finite number of...

Rajiv Soundararajan, Sriram Vishwanath

claim paper

Read More »

click to vote

MICRO
2006
IEEE

73views Hardware» more MICRO 2006»

Merging Head and Tail Duplication for Convergent Hyperblock Formation

14 years 1 months ago

Download userweb.cs.utexas.edu

VLIW and EDGE (Explicit Data Graph Execution) architectures rely on compilers to form high-quality hyperblocks for good performance. These compilers typically perform hyperblock f...

Bertrand A. Maher, Aaron Smith, Doug Burger, Kathr...

claim paper

Read More »

click to vote

GLOBECOM
2009
IEEE

126views Communications» more GLOBECOM 2009»

Stochastic Resource Allocation over Fading Multiple Access and Broadcast Channels

13 years 11 months ago

Download www.ee.fau.edu

In this paper, we consider the optimal rate and power allocation that maximizes a general utility function of average user rates in a fading multiple-access or broadcast channel. B...

Na Gao, Xin Wang

claim paper

Read More »

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 7 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers