Search Sciweavers | Sciweavers

160 search results - page 6 / 32

» Optimization on a Budget: A Reinforcement Learning Approach

158

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 9 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

131

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 6 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

138

Voted

ICAI
2004

116views Artificial Intelligence» more ICAI 2004»

Action Inhibition

15 years 6 months ago

Download mysite.verizon.net

An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...

Myriam Abramson

claim paper

Read More »

177

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 6 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

175

click to vote

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

15 years 3 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

« Prev « First page 6 / 32 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers