Search Sciweavers | Sciweavers

171

CORR
2011
Springer

136views Education» more CORR 2011»

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

14 years 9 months ago

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using ...

Enric Celaya, Josep M. Porta

claim paper

Read More »

126

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

15 years 11 months ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

148

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

15 years 9 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

139

click to vote

ILP
2000
Springer

130views Automated Reasoning» more ILP 2000»

Using ILP to Improve Planning in Hierarchical Reinforcement Learning

15 years 9 months ago

Download mark.reid.name

Hierarchical reinforcement learning has been proposed as a solution to the problem of scaling up reinforcement learning. The RLTOPs Hierarchical Reinforcement Learning System is an...

Mark D. Reid, Malcolm R. K. Ryan

claim paper

Read More »

191

click to vote

WSC
2007

166views Modeling And Simulation» more WSC 2007»

Optimizing time warp simulation with reinforcement learning techniques

15 years 8 months ago

Download www.informs-sim.org

Adaptive Time Warp protocols in the literature are usually based on a pre-deﬁned analytic model of the system, expressed as a closed form function that maps system state to cont...

Jun Wang, Carl Tropper

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers