Search Sciweavers | Sciweavers

236 search results - page 32 / 48

» Non-linear dynamics in multiagent reinforcement learning alg...

210

Voted

AAAI
2000

104views Intelligent Agents» more AAAI 2000»

Inter-Layer Learning Towards Emergent Cooperative Behavior

15 years 8 months ago

Download www.cs.cmu.edu

As applications for artificially intelligent agents increase in complexity we can no longer rely on clever heuristics and hand-tuned behaviors to develop their programming. Even t...

Shawn Arseneau, Wei Sun, Changpeng Zhao, Jeremy R....

claim paper

Read More »

240

Voted

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 1 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

197

click to vote

ATAL
2009
Springer

151views Intelligent Agents» more ATAL 2009»

Multiagent learning in large anonymous games

16 years 2 months ago

Download people.seas.harvard.edu

In large systems, it is important for agents to learn to act effectively, but sophisticated multi-agent learning algorithms generally do not scale. An alternative approach is to �...

Ian A. Kash, Eric J. Friedman, Joseph Y. Halpern

claim paper

Read More »

200

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

189

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 1 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

« Prev « First page 32 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers