Search Sciweavers | Sciweavers

473 search results - page 29 / 95

» Optimal policy switching algorithms for reinforcement learni...

146

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

15 years 7 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

200

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 22 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

169

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 6 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

154

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

15 years 12 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

170

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 10 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

« Prev « First page 29 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers