Sciweavers

343 search results - page 21 / 69
» Action discovery for reinforcement learning
Sort
View
NN
2002
Springer
113views Neural Networks» more  NN 2002»
13 years 7 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
ECML
2006
Springer
13 years 11 months ago
Skill Acquisition Via Transfer Learning and Advice Taking
We describe a reinforcement learning system that transfers skills from a previously learned source task to a related target task. The system uses inductive logic programming to ana...
Lisa Torrey, Jude W. Shavlik, Trevor Walker, Richa...
NN
2006
Springer
13 years 7 months ago
The misbehavior of value and the discipline of the will
Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...
Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...
AR
2008
118views more  AR 2008»
13 years 7 months ago
Efficient Behavior Learning Based on State Value Estimation of Self and Others
The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...
Yasutake Takahashi, Kentarou Noma, Minoru Asada
IWANN
1999
Springer
13 years 12 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson