Search Sciweavers | Sciweavers

473 search results - page 56 / 95

» Optimal policy switching algorithms for reinforcement learni...

164

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 4 days ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

137

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

15 years 11 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

162

click to vote

AAAI
2006

116views Intelligent Agents» more AAAI 2006»

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping

15 years 7 months ago

Download www.cs.utexas.edu

Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...

Yaxin Liu, Peter Stone

claim paper

Read More »

151

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 6 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

202

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 6 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

« Prev « First page 56 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers