Sciweavers

1236 search results - page 210 / 248
» Opposition-Based Reinforcement Learning
Sort
View
ICML
2008
IEEE
14 years 10 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...
ATAL
2008
Springer
13 years 12 months ago
Adaptive Kanerva-based function approximation for multi-agent systems
In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...
Cheng Wu, Waleed Meleis
SIGCSE
2009
ACM
119views Education» more  SIGCSE 2009»
14 years 10 months ago
Implications of integrating test-driven development into CS1/CS2 curricula
Many academic and industry professionals have called for more testing in computer science curricula. Test-driven development (TDD) has been proposed as a solution to improve testi...
Chetan Desai, David S. Janzen, John Clements
GECCO
2009
Springer
14 years 4 months ago
On the scalability of XCS(F)
Many successful applications have proven the potential of Learning Classifier Systems and the XCS classifier system in particular in datamining, reinforcement learning, and func...
Patrick O. Stalph, Martin V. Butz, David E. Goldbe...
IROS
2007
IEEE
164views Robotics» more  IROS 2007»
14 years 4 months ago
Emulation and behavior understanding through shared values
— Neurophysiology has revealed the existence of mirror neurons in brain of macaque monkeys and they shows similar activities during executing an observation of goal directed move...
Yasutake Takahashi, Teruyasu Kawamata, Minoru Asad...