Sciweavers

1235 search results - page 170 / 247
» Reinforcement learning in a nutshell
Sort
View
ISNN
2007
Springer
14 years 4 months ago
Online Dynamic Value System for Machine Learning
A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...
Haibo He, Janusz A. Starzyk
ATAL
2004
Springer
14 years 3 months ago
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents
This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, specifically, geneticlearning-parentin...
Michael Berger, Jeffrey S. Rosenschein
ESANN
2007
13 years 11 months ago
Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...
Jan Peters, Stefan Schaal
ML
1998
ACM
136views Machine Learning» more  ML 1998»
13 years 9 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
MAGS
2010
81views more  MAGS 2010»
13 years 4 months ago
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation
Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...
Sébastien Paquet, Brahim Chaib-draa, Patric...