Search Sciweavers | Sciweavers

226 search results - page 12 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

13 years 8 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

13 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

click to vote

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Automatic shaping and decomposition of reward functions

14 years 7 months ago

Download www.machinelearning.org

This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...

Bhaskara Marthi

claim paper

Read More »

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

13 years 8 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 12 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers