Search Sciweavers | Sciweavers

91 search results - page 14 / 19

» Parameter-exploring policy gradients

164

click to vote

GLOBECOM
2009
IEEE

126views Communications» more GLOBECOM 2009»

Stochastic Resource Allocation over Fading Multiple Access and Broadcast Channels

15 years 10 months ago

Download www.ee.fau.edu

In this paper, we consider the optimal rate and power allocation that maximizes a general utility function of average user rates in a fading multiple-access or broadcast channel. B...

Na Gao, Xin Wang

claim paper

Read More »

175

click to vote

ICAC
2008
IEEE

123views Applied Computing» more ICAC 2008»

Generating Adaptation Policies for Multi-tier Applications in Consolidated Server Environments

16 years 19 days ago

Download www.cc.gatech.edu

Creating good adaptation policies is critical to building complex autonomic systems since it is such policies that deﬁne the system conﬁguration used in any given situation. W...

Gueyoung Jung, Kaustubh R. Joshi, Matti A. Hiltune...

claim paper

Read More »

162

click to vote

NIPS
2008

159views Information Technology» more NIPS 2008»

Policy Search for Motor Primitives in Robotics

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high...

Jens Kober, Jan Peters

claim paper

Read More »

163

click to vote

TIT
2010

115views Education» more TIT 2010»

On resource allocation in fading multiple-access channels-an efficient approximate projection approach

15 years 27 days ago

Download web.mit.edu

We consider the problem of rate and power allocation in a multiple-access channel. Our objective is to obtain rate and power allocation policies that maximize a general concave ut...

Ali ParandehGheibi, Atilla Eryilmaz, Asuman E. Ozd...

claim paper

Read More »

170

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 14 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers