Search Sciweavers | Sciweavers

106 search results - page 10 / 22

» Performance Bounded Reinforcement Learning in Strategic Inte...

121

click to vote

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

15 years 10 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

146

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 5 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

129

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 5 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

134

click to vote

ATAL
2008
Springer

131views Intelligent Agents» more ATAL 2008»

A new perspective to the keepaway soccer: the takers

15 years 6 months ago

Download www.aamas-conference.org

Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...

Atil Iscen, Umut Erogul

claim paper

Read More »

159

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 1 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 10 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers