Search Sciweavers | Sciweavers

2108 search results - page 17 / 422

» Tracking in Reinforcement Learning

201

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 7 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

180

Voted

ICMLA
2009

228views Machine Learning» more ICMLA 2009»

The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting

15 years 4 months ago

Download www.timkietzmann.de

This paper describes a novel real-world reinforcement learning application: The Neuro Slot Car Racer. In addition to presenting the system and first results based on Neural Fitted...

Tim C. Kietzmann, Martin Riedmiller

claim paper

Read More »

229

click to vote

ABIALS
2008
Springer

281views Artificial Intelligence» more ABIALS 2008»

Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

15 years 8 months ago

Download www.isir.upmc.fr

Factored Reinforcement Learning (frl) is a new technique to solve Factored Markov Decision Problems (fmdps) when the structure of the problem is not known in advance. Like Anticipa...

Olivier Sigaud, Martin V. Butz, Olga Kozlova, Chri...

claim paper

Read More »

170

click to vote

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Bellman goes relational

16 years 7 months ago

Download people.csail.mit.edu

Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...

Kristian Kersting, Martijn Van Otterlo, Luc De Rae...

claim paper

Read More »

165

click to vote

ECML
2004
Springer

157views Machine Learning» more ECML 2004»

Model Approximation for HEXQ Hierarchical Reinforcement Learning

16 years 7 days ago

Download www.cse.unsw.edu.au

HEXQ is a reinforcement learning algorithm that discovers hierarchical structure automatically. The generated task hierarchy repthe problem at diﬀerent levels of abstraction. In ...

Bernhard Hengst

claim paper

Read More »

« Prev « First page 17 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers