Sciweavers

1630 search results - page 189 / 326
» Coordinated Reinforcement Learning
Sort
View
ISNN
2007
Springer
15 years 10 months ago
Online Dynamic Value System for Machine Learning
A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...
Haibo He, Janusz A. Starzyk
ATAL
2004
Springer
15 years 9 months ago
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents
This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, specifically, geneticlearning-parentin...
Michael Berger, Jeffrey S. Rosenschein
ESANN
2007
15 years 5 months ago
Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...
Jan Peters, Stefan Schaal
ML
1998
ACM
136views Machine Learning» more  ML 1998»
15 years 3 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
ICML
2002
IEEE
16 years 4 months ago
Learning from Scarce Experience
Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...
Leonid Peshkin, Christian R. Shelton