Search Sciweavers | Sciweavers

164 search results - page 13 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

143

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

16 years 1 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

170

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 7 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

202

click to vote

ICCV
2003
IEEE

156views Computer Vision» more ICCV 2003»

Reinforcement Learning for Combining Relevance Feedback Techniques

15 years 12 months ago

Download lear.inrialpes.fr

Relevance feedback (RF) is an interactive process which refines the retrievals by utilizing user’s feedback history. Most researchers strive to develop new RF techniques and ign...

Peng-Yeng Yin, Bir Bhanu, Kuang-Cheng Chang, Anlei...

claim paper

Read More »

159

click to vote

ICES
2003
Springer

125views Hardware» more ICES 2003»

Evolving Reinforcement Learning-Like Abilities for Robots

15 years 12 months ago

Download lis.epfl.ch

Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...

Jesper Blynel

claim paper

Read More »

167

Voted

KBS
2006

105views more KBS 2006»

Robot docking based on omnidirectional vision and reinforcement learning

15 years 6 months ago

Download www.eecs.wsu.edu

We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...

David Muse, Cornelius Weber, Stefan Wermter

claim paper

Read More »

« Prev « First page 13 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers