Search Sciweavers | Sciweavers

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

147

click to vote

NECO
2002

105views more NECO 2002»

Multiple Model-Based Reinforcement Learning

15 years 5 months ago

Download www.cns.atr.jp

We propose a modular reinforcement learning architecture for non-linear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic i...

Kenji Doya, Kazuyuki Samejima, Ken-ichi Katagiri, ...

claim paper

Read More »

201

click to vote

IAT
2003
IEEE

171views Intelligent Agents» more IAT 2003»

Asymmetric Multiagent Reinforcement Learning

15 years 11 months ago

Download lib.tkk.fi

A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...

Ville Könönen

claim paper

Read More »

« Prev « First page 3 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers