On Learning Soccer Strategies

15 years 11 months ago

Download igitur-archive.library.uu.nl

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy but may behave differently due to position-dependent inputs. All agents making up a team are rewarded or punished collectively in case of goals. We conduct simulations with varying team sizes, and compare two learning algorithms: TD-Q learning with linear neural networks (TD-Q) and Probabilistic Incremental Program Evolution (PIPE). TD-Q is based on evaluation functions (EFs) mapping input/action pairs to expected reward, while PIPE searches policy space directly. PIPE uses an adaptive probability distribution to synthesize programs that calculate action probabilities from current inputs. Our results show that TD-Q has di culties to learn appropriate shared EFs. PIPE, however, does not depend on EFs and nds good policies faster and more reliably.

Rafal Salustowicz, Marco Wiering, Jürgen Schm

Real-time Traffic

Appropriate Shared Efs | ICANN 1997 | Neural Networks | PIPE Searches Policy | Probabilistic Incremental Program |

claim paper

» Game Theorybased Data Mining Technique for Strategy Making of a Soccer Simulation Coach Ag...

» Making a Robot Learn to Play Soccer Using Reward and Punishment

» Learning to Select Negotiation Strategies in Multiagent Meeting Scheduling

» Reinforcement Learning Soccer Teams with Incomplete World Models

» Eventdriven learning classifier systems for online soccer games

» Reward allotment in an eventdriven hybrid learning classifier system for online soccer gam...

» The Evolution of a Robot Soccer Team

» Making soccer kicks better a study in particle swarm optimization and evolution strategies

Post Info
More Details (n/a)

Added	08 Aug 2010
Updated	08 Aug 2010
Type	Conference
Year	1997
Where	ICANN
Authors	Rafal Salustowicz, Marco Wiering, Jürgen Schmidhuber

Comments (0)

Sciweavers

On Learning Soccer Strategies

Appropriate Shared Efs | ICANN 1997 | Neural Networks | PIPE Searches Policy | Probabilistic Incremental Program |

Explore & Download

Productivity Tools

Sciweavers