Search Sciweavers | Sciweavers

1235 search results - page 139 / 247

» Reinforcement learning in a nutshell

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 4 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

108

click to vote

ICCBR
2009
Springer

159views Automated Reasoning» more ICCBR 2009»

Case-Based Reasoning in Transfer Learning

15 years 10 months ago

Download www.knexusresearch.com

Positive transfer learning (TL) occurs when, after gaining experience from learning how to solve a (source) task, the same learner can exploit this experience to improve performanc...

David W. Aha, Matthew Molineaux, Gita Sukthankar

claim paper

Read More »

113

Voted

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

15 years 9 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

119

Voted

ECML
1997
Springer

79views Machine Learning» more ECML 1997»

Ibots Learn Genuine Team Solutions

15 years 7 months ago

Download www.idsia.ch

\Ibots" (Integrating roBOTS) is a computer experiment in group learning. It is designed to understand how to use reinforcement learning to program automatically a team of robo...

Cristina Versino, Luca Maria Gambardella

claim paper

Read More »

126

Voted

ICMAS
1998

157views Intelligent Agents» more ICMAS 1998»

The Moving Target Function Problem in Multi-Agent Learning

15 years 5 months ago

Download jmvidal.cse.sc.edu

We describe a framework that can be used to model and predict the behavior of MASs with learning agents. It uses a difference equation for calculating the progression of an agent&...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 139 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers