Search Sciweavers | Sciweavers

651 search results - page 37 / 131

» Algorithms for Inverse Reinforcement Learning

163

Voted

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 4 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

172

click to vote

AIIDE
2006

123views Artificial Intelligence» more AIIDE 2006»

The Self Organization of Context for Learning in MultiAgent Games

15 years 7 months ago

Download www.aaai.org

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, ti...

Christopher D. White, Dave Brogan

claim paper

Read More »

176

Voted

SGAI
2004
Springer

101views Artificial Intelligence» more SGAI 2004»

Interactive Selection of Visual Features through Reinforcement Learning

15 years 11 months ago

Download www.montefiore.ulg.ac.be

We introduce a new class of Reinforcement Learning algorithms designed to operate in perceptual spaces containing images. They work by classifying the percepts using a computer vi...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

168

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 9 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

174

Voted

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 7 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

« Prev « First page 37 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers