Search Sciweavers | Sciweavers

71 search results - page 1 / 15

» An Analysis of Direct Reinforcement Learning in Non-Markovia...

click to vote

ICML
1998
IEEE

149views Machine Learning» more ICML 1998»

An Analysis of Direct Reinforcement Learning in Non-Markovian Domains

14 years 8 months ago

Download staff.uqu.edu.sa

Mark D. Pendrith, Michael McGarity

claim paper

Read More »

click to vote

NIPS
2001

101views Information Technology» more NIPS 2001»

Reinforcement Learning with Long Short-Term Memory

13 years 9 months ago

Download staff.science.uva.nl

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...

Bram Bakker

claim paper

Read More »

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

13 years 12 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

14 years 8 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

« Prev « First page 1 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers