Search Sciweavers | Sciweavers

536 search results - page 8 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

119

Voted

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 2 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

124

Voted

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

14 years 9 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

103

Voted

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 4 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

103

click to vote

ICMLA
2007

92views Machine Learning» more ICMLA 2007»

Control of a re-entrant line manufacturing model with a reinforcement learning approach

15 years 4 months ago

Download www.smitlab.uc.edu

This paper presents the application of a reinforcement learning (RL) approach for the near-optimal control of a re-entrant line manufacturing (RLM) model. The RL approach utilizes...

José A. Ramírez-Hernández, Em...

claim paper

Read More »

125

Voted

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 6 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

« Prev « First page 8 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers