Search Sciweavers | Sciweavers

473 search results - page 10 / 95

» Optimal policy switching algorithms for reinforcement learni...

185

click to vote

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 3 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

165

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

16 years 9 days ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

155

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

15 years 5 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

181

click to vote

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

15 years 6 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

137

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 3 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

« Prev « First page 10 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers