Search Sciweavers | Sciweavers

1236 search results - page 34 / 248

» Opposition-Based Reinforcement Learning

174

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 7 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

210

click to vote

ABIALS
2008
Springer

281views Artificial Intelligence» more ABIALS 2008»

Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

15 years 8 months ago

Download www.isir.upmc.fr

Factored Reinforcement Learning (frl) is a new technique to solve Factored Markov Decision Problems (fmdps) when the structure of the problem is not known in advance. Like Anticipa...

Olivier Sigaud, Martin V. Butz, Olga Kozlova, Chri...

claim paper

Read More »

155

click to vote

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Bellman goes relational

16 years 7 months ago

Download people.csail.mit.edu

Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...

Kristian Kersting, Martijn Van Otterlo, Luc De Rae...

claim paper

Read More »

141

click to vote

TSMC
2002

136views more TSMC 2002»

Expertness based cooperative Q-learning

15 years 5 months ago

Download birg2.epfl.ch

By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...

Majid Nili Ahmadabadi, Masoud Asadpour

claim paper

Read More »

152

click to vote

ECML
2004
Springer

157views Machine Learning» more ECML 2004»

Model Approximation for HEXQ Hierarchical Reinforcement Learning

15 years 11 months ago

Download www.cse.unsw.edu.au

HEXQ is a reinforcement learning algorithm that discovers hierarchical structure automatically. The generated task hierarchy repthe problem at diﬀerent levels of abstraction. In ...

Bernhard Hengst

claim paper

Read More »

« Prev « First page 34 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers