Search Sciweavers | Sciweavers

81 search results - page 5 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

196

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 8 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

226

click to vote

GECCO
2006
Springer

198views Optimization» more GECCO 2006»

Reward allotment in an event-driven hybrid learning classifier system for online soccer games

15 years 11 months ago

Download www.cs.bham.ac.uk

This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...

Yuji Sato, Yosuke Akatsuka, Takenori Nishizono

claim paper

Read More »

208

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 11 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

215

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

16 years 8 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

205

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

16 years 1 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

« Prev « First page 5 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers