Search Sciweavers | Sciweavers

178 search results - page 14 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

FLAIRS
2007

164views Artificial Intelligence» more FLAIRS 2007»

A Generalizing Spatial Representation for Robot Navigation with Reinforcement Learning

13 years 11 months ago

Download www.aussagekraft.de

In robot navigation tasks, the representation of the surrounding world plays an important role, especially in reinforcement learning approaches. This work presents a qualitative r...

Lutz Frommberger

claim paper

Read More »

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

13 years 10 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 3 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

IROS
2006
IEEE

187views Robotics» more IROS 2006»

Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

14 years 2 months ago

Download hawaii.aist-nara.ac.jp

— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...

Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...

claim paper

Read More »

click to vote

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

14 years 9 months ago

Download www.machinelearning.org

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 14 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers