Sciweavers

2045 search results - page 18 / 409
» Learning programming with Erlang
Sort
View
ESANN
2006
13 years 9 months ago
Reducing policy degradation in neuro-dynamic programming
We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...
Thomas Gabel, Martin Riedmiller
COLT
1992
Springer
13 years 11 months ago
PAC-Learnability of Determinate Logic Programs
Saso Dzeroski, Stephen Muggleton, Stuart J. Russel...
GECCO
2011
Springer
276views Optimization» more  GECCO 2011»
12 years 11 months ago
Evolution of reward functions for reinforcement learning
The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...
Scott Niekum, Lee Spector, Andrew G. Barto