Sciweavers

164 search results - page 13 / 33
» Self-Optimizing Memory Controllers: A Reinforcement Learning...
Sort
View
ATAL
2009
Springer
14 years 2 months ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
ICML
2008
IEEE
14 years 8 months ago
Reinforcement learning in the presence of rare events
We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...
Jordan Frank, Shie Mannor, Doina Precup
ICCV
2003
IEEE
14 years 25 days ago
Reinforcement Learning for Combining Relevance Feedback Techniques
Relevance feedback (RF) is an interactive process which refines the retrievals by utilizing user’s feedback history. Most researchers strive to develop new RF techniques and ign...
Peng-Yeng Yin, Bir Bhanu, Kuang-Cheng Chang, Anlei...
ICES
2003
Springer
125views Hardware» more  ICES 2003»
14 years 22 days ago
Evolving Reinforcement Learning-Like Abilities for Robots
Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...
Jesper Blynel
KBS
2006
105views more  KBS 2006»
13 years 7 months ago
Robot docking based on omnidirectional vision and reinforcement learning
We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...
David Muse, Cornelius Weber, Stefan Wermter