Search Sciweavers | Sciweavers

61 search results - page 9 / 13

» Market-Based Reinforcement Learning in Partially Observable ...

124

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 5 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

134

click to vote

ICML
2004
IEEE

123views Machine Learning» more ICML 2004»

Learning low dimensional predictive representations

16 years 5 months ago

Download www.cs.cmu.edu

Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...

Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...

claim paper

Read More »

109

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

124

click to vote

IROS
2007
IEEE

164views Robotics» more IROS 2007»

Emulation and behavior understanding through shared values

15 years 10 months ago

Download www.er.ams.eng.osaka-u.ac.jp

— Neurophysiology has revealed the existence of mirror neurons in brain of macaque monkeys and they shows similar activities during executing an observation of goal directed move...

Yasutake Takahashi, Teruyasu Kawamata, Minoru Asad...

claim paper

Read More »

149

click to vote

SIGECOM
2006
ACM

139views ECommerce» more SIGECOM 2006»

Playing games in many possible worlds

15 years 10 months ago

Download lpd.epfl.ch

In traditional game theory, players are typically endowed with exogenously given knowledge of the structure of the game—either full omniscient knowledge or partial but ﬁxed in...

Matt Lepinski, David Liben-Nowell, Seth Gilbert, A...

claim paper

Read More »

« Prev « First page 9 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers