Search Sciweavers | Sciweavers

226 search results - page 42 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

13 years 8 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

click to vote

AAMAS
2006
Springer

179views Intelligent Agents» more AAMAS 2006»

An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games

13 years 7 months ago

Download www.personeel.unimaas.nl

In this paper, we investigate Reinforcement learning (RL) in multi-agent systems (MAS) from an evolutionary dynamical perspective. Typical for a MAS is that the environment is not ...

Karl Tuyls, Pieter Jan't Hoen, Bram Vanschoenwinke...

claim paper

Read More »

click to vote

GECCO
2006
Springer

195views Optimization» more GECCO 2006»

Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions

13 years 11 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...

Martin V. Butz, Martin Pelikan

claim paper

Read More »

click to vote

EC
2006

121views ECommerce» more EC 2006»

A Study of Structural and Parametric Learning in XCS

13 years 7 months ago

Download www.cs.bris.ac.uk

The performance of a learning classifier system is due to its two main components. First, it evolves new structures by generating new rules in a genetic process; second, it adjust...

Tim Kovacs, Manfred Kerber

claim paper

Read More »

« Prev « First page 42 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers