Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» An empirical analysis of value function-based and policy sea...

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

14 years 2 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

CCIA
2005
Springer

117views Artificial Intelligence» more CCIA 2005»

Direct Policy Search Reinforcement Learning for Robot Control

14 years 29 days ago

Download vicorob.udg.es

— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...

Andres El-Fakdi, Marc Carreras, Narcís Palo...

claim paper

Read More »

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

Stronger CDA strategies through empirical game-theoretic analysis and reinforcement learning

14 years 2 months ago

Download ai.eecs.umich.edu

We present a general methodology to automate the search for equilibrium strategies in games derived from computational experimentation. Our approach interleaves empirical game-the...

L. Julian Schvartzman, Michael P. Wellman

claim paper

Read More »

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

14 years 8 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

14 years 1 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers