Search Sciweavers | Sciweavers

85 search results - page 11 / 17

» Solving Stochastic Planning Problems with Large State and Ac...

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

14 years 8 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

14 years 1 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

IROS
2009
IEEE

125views Robotics» more IROS 2009»

A tale of two planners: Modular robotic planning with LDP

14 years 2 months ago

Download www.cs.cmu.edu

Abstract— LDP (Locally Distributed Predicates) is a distributed, high-level language for programming modular reconﬁgurable robot systems (MRRs). In this paper we present the im...

Michael DeRosa, Seth Copen Goldstein, Peter Lee, P...

claim paper

Read More »

click to vote

AIPS
2000

130views Artificial Intelligence» more AIPS 2000»

New Results about LCGP, a Least Committed GraphPlan

13 years 8 months ago

Download v.vidal.free.fr

Planners from the family of Graphplan (Graphplan, IPP, STAN...) are presently considered as the most efficient ones on numerous planning domains. Their partially ordered plans can...

Michel Cayrol, Pierre Régnier, Vincent Vida...

claim paper

Read More »

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 11 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers