Sciweavers

1235 search results - page 33 / 247

» Reinforcement learning in a nutshell

54

ATAL
2009
Springer

77views Intelligent Agents» more ATAL 2009»

Learning with whom to communicate using relational reinforcement learning

15 years 9 months ago

Learning with whom to communicate using relational reinforcement learning

Download www.aamas-conference.org

Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...

claim paper

Read More »

96

FLAIRS
2003

117views Artificial Intelligence» more FLAIRS 2003»

Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies

15 years 4 months ago

Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies

Download www.cse.uta.edu

Sandeep Goel, Manfred Huber

claim paper

Read More »

85

NIPS
1996

89views Information Technology» more NIPS 1996»

Learning Decision Theoretic Utilities through Reinforcement Learning

15 years 4 months ago

Learning Decision Theoretic Utilities through Reinforcement Learning

Download papers.cnl.salk.edu

Magnus Stensmo, Terrence J. Sejnowski

claim paper

Read More »

139

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 4 months ago

Gradient Descent for General Reinforcement Learning

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

124

AUSAI
2005
Springer

123views Artificial Intelligence» more AUSAI 2005»

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

15 years 8 months ago

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

Download eprints.utas.edu.au

: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...

Peter Vamplew, Robert Ollington

claim paper

Read More »

« Prev « First page 33 / 247 Last » Next »