Search Sciweavers | Sciweavers

178 search results - page 16 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 3 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

ICML
1997
IEEE

194views Machine Learning» more ICML 1997»

Hierarchical Explanation-Based Reinforcement Learning

14 years 9 months ago

Download reference.kfupm.edu.sa

Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...

Prasad Tadepalli, Thomas G. Dietterich

claim paper

Read More »

click to vote

ICMLA
2009

185views Machine Learning» more ICMLA 2009»

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

13 years 6 months ago

Download staff.science.uva.nl

Abstract--Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks a...

Mark Kroon, Shimon Whiteson

claim paper

Read More »

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

13 years 10 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 10 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

« Prev « First page 16 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers