Sciweavers

748 search results - page 22 / 150
» A Reinforcement Learning Algorithm for Spiking Neural Networ...
Sort
View
ICANN
2010
Springer
13 years 7 months ago
Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients
Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...
Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...
AI
2002
Springer
13 years 7 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro
ICANN
2003
Springer
14 years 16 days ago
Optimal Hebbian Learning: A Probabilistic Point of View
Many activity dependent learning rules have been proposed in order to model long-term potentiation (LTP). Our aim is to derive a spike time dependent learning rule from a probabili...
Jean-Pascal Pfister, David Barber, Wulfram Gerstne...
NN
2002
Springer
113views Neural Networks» more  NN 2002»
13 years 7 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
NN
2007
Springer
105views Neural Networks» more  NN 2007»
13 years 6 months ago
Guiding exploration by pre-existing knowledge without modifying reward
Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...
Kary Främling