Search Sciweavers | Sciweavers

473 search results - page 23 / 95

» Optimal policy switching algorithms for reinforcement learni...

133

click to vote

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

168

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 7 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

199

click to vote

AAAI
2011

206views Intelligent Agents» more AAAI 2011»

Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs

14 years 6 months ago

Download www.cs.umass.edu

In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

147

click to vote

ECAI
2006
Springer

89views Artificial Intelligence» more ECAI 2006»

Learning by Automatic Option Discovery from Conditionally Terminating Sequences

15 years 9 months ago

Download www.ceng.metu.edu.tr

Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...

Sertan Girgin, Faruk Polat, Reda Alhajj

claim paper

Read More »

166

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 6 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 23 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers