Search Sciweavers | Sciweavers

17 search results - page 1 / 4

» Value Function Based Reinforcement Learning in Changing Mark...

135

click to vote

JMLR
2008

69views more JMLR 2008»

Value Function Based Reinforcement Learning in Changing Markovian Environments

15 years 6 months ago

Download jmlr.csail.mit.edu

Balázs Csanád Csáji, Lá...

claim paper

Read More »

219

Voted

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

213

click to vote

IAT
2005
IEEE

180views Intelligent Agents» more IAT 2005»

Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment

16 years 15 days ago

Download www3.ntu.edu.sg

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...

Ah-Hwee Tan, Dan Xiao

claim paper

Read More »

188

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 9 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

200

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers