Search Sciweavers | Sciweavers

215 search results - page 8 / 43

» Model-Based Reinforcement Learning with Continuous States an...

144

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 11 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

154

click to vote

ICANN
2005
Springer

151views Neural Networks» more ICANN 2005»

Reinforcement Learning in MirrorBot

15 years 10 months ago

Download fias.uni-frankfurt.de

For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...

Cornelius Weber, David Muse, Mark Elshaw, Stefan W...

claim paper

Read More »

143

click to vote

AAAI
2010

173views Intelligent Agents» more AAAI 2010»

Integrating Sample-Based Planning and Model-Based Reinforcement Learning

15 years 6 months ago

Download paul.rutgers.edu

Recent advancements in model-based reinforcement learning have shown that the dynamics of many structured domains (e.g. DBNs) can be learned with tractable sample complexity, desp...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littma...

claim paper

Read More »

122

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

15 years 11 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

123

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 5 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

« Prev « First page 8 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers