Search Sciweavers | Sciweavers

93 search results - page 11 / 19

» Trajectory Optimization using Reinforcement Learning for Map...

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

ALIFE
2002

176views Modeling And Simulation» more ALIFE 2002»

Ant Colony Optimization and Stochastic Gradient Descent

13 years 7 months ago

Download ti.arc.nasa.gov

In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...

Nicolas Meuleau, Marco Dorigo

claim paper

Read More »

click to vote

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

13 years 8 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

13 years 5 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

click to vote

ICML
2006
IEEE

193views Machine Learning» more ICML 2006»

Maximum margin planning

14 years 8 months ago

Download www.cs.cmu.edu

Mobile robots often rely upon systems that render sensor data and perceptual features into costs that can be used in a planner. The behavior that a designer wishes the planner to ...

Nathan D. Ratliff, J. Andrew Bagnell, Martin Zinke...

claim paper

Read More »

« Prev « First page 11 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers