Search Sciweavers | Sciweavers

1236 search results - page 8 / 248

» Opposition-Based Reinforcement Learning

211

Voted

ICCCI
2011
Springer

223views Intelligent Agents» more ICCCI 2011»

Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors

14 years 2 months ago

Download florinleon.byethost24.com

Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...

Florin Leon

claim paper

Read More »

126

Voted

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 1 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

149

Voted

AGI
2011

231views Artificial Intelligence» more AGI 2011»

Reinforcement Learning and the Bayesian Control Rule

14 years 6 months ago

Download metatip.com

We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...

Pedro Alejandro Ortega, Daniel Alexander Braun, Si...

claim paper

Read More »

138

Voted

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

15 years 2 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

197

Voted

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

14 years 6 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

« Prev « First page 8 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers