Sciweavers

44 search results - page 4 / 9
» Robot reinforcement learning using EEG-based reward signals
Sort
View
NECO
2007
150views more  NECO 2007»
13 years 6 months ago
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...
Dorit Baras, Ron Meir
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 26 days ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
NECO
2007
258views more  NECO 2007»
13 years 6 months ago
Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
The persistent modification of synaptic efficacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...
Razvan V. Florian
IROS
2009
IEEE
154views Robotics» more  IROS 2009»
14 years 1 months ago
Consideration on robotic giant-swing motion generated by reinforcement learning
—This study attempts to make a compact humanoid robot acquire a giant-swing motion without any robotic models by using reinforcement learning; only the interaction with environme...
Masayuki Hara, Naoto Kawabe, Naoki Sakai, Jian Hua...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
14 years 1 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...