Search Sciweavers | Sciweavers

178 search results - page 8 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

13 years 8 months ago

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by us...

Matthew Grounds, Daniel Kudenko

claim paper

Read More »

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

14 years 3 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

click to vote

NN
2006
Springer

72views Neural Networks» more NN 2006»

Neural systems implicated in delayed and probabilistic reinforcement

13 years 8 months ago

Download egret.psychol.cam.ac.uk

This review considers the theoretical problems facing agents that must learn and choose on the basis of reward or reinforcement that is uncertain or delayed, in implicit or proced...

Rudolf N. Cardinal

claim paper

Read More »

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

14 years 8 days ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

ATAL
2007
Springer

147views Intelligent Agents» more ATAL 2007»

A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems

14 years 18 days ago

Download www.haizhengzhang.com

The dominant existing routing strategies employed in peerto-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend o...

Haizheng Zhang, Victor R. Lesser

claim paper

Read More »

« Prev « First page 8 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers