Search Sciweavers | Sciweavers

113 search results - page 8 / 23

» Learning Representation and Control in Continuous Markov Dec...

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

14 years 1 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

12 years 4 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

14 years 3 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

GECCO
2005
Springer

130views Optimization» more GECCO 2005»

ATNoSFERES revisited

14 years 2 months ago

Download www.cs.bham.ac.uk

ATNoSFERES is a Pittsburgh style Learning Classiﬁer System (LCS) in which the rules are represented as edges of an Augmented Transition Network. Genotypes are strings of tokens ...

Samuel Landau, Olivier Sigaud, Marc Schoenauer

claim paper

Read More »

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

13 years 10 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

« Prev « First page 8 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers