Search Sciweavers | Sciweavers

61 search results - page 10 / 13

» Market-Based Reinforcement Learning in Partially Observable ...

172

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 3 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

163

click to vote

AIIDE
2009

297views Artificial Intelligence» more AIIDE 2009»

IMPLANT: An Integrated MDP and POMDP Learning AgeNT for Adaptive Games

15 years 2 months ago

Download www.comp.nus.edu.sg

This paper proposes an Integrated MDP and POMDP Learning AgeNT (IMPLANT) architecture for adaptation in modern games. The modern game world basically involves a human player actin...

Chek Tien Tan, Ho-Lun Cheng

claim paper

Read More »

136

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 10 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

145

click to vote

JMLR
2008

141views more JMLR 2008»

Accelerated Neural Evolution through Cooperatively Coevolved Synapses

15 years 4 months ago

Download www.idsia.ch

Many complex control problems require sophisticated solutions that are not amenable to traditional controller design. Not only is it difficult to model real world systems, but oft...

Faustino J. Gomez, Jürgen Schmidhuber, Risto ...

claim paper

Read More »

149

click to vote

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

15 years 5 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

« Prev « First page 10 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers