Search Sciweavers | Sciweavers

779 search results - page 18 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

197

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 6 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

224

click to vote

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

15 years 9 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

189

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

15 years 5 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

235

click to vote

AI
2006
Springer

119views Artificial Intelligence» more AI 2006»

Partial Local FriendQ Multiagent Learning: Application to Team Automobile Coordination Problem

15 years 11 months ago

Download damas.ift.ulaval.ca

Real world multiagent coordination problems are important issues for reinforcement learning techniques. In general, these problems are partially observable and this characteristic ...

Julien Laumonier, Brahim Chaib-draa

claim paper

Read More »

235

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 22 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

« Prev « First page 18 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers