Search Sciweavers | Sciweavers

61 search results - page 11 / 13

» Market-Based Reinforcement Learning in Partially Observable ...

155

click to vote

ICRA
2010
IEEE

153views Robotics» more ICRA 2010»

Learning to navigate through crowded environments

15 years 3 months ago

Download www.cs.washington.edu

— The goal of this research is to enable mobile robots to navigate through crowded environments such as indoor shopping malls, airports, or downtown side walks. The key research ...

Peter Henry, Christian Vollmer, Brian Ferris, Diet...

claim paper

Read More »

151

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

16 years 5 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

140

click to vote

ECML
2006
Springer

84views Machine Learning» more ECML 2006»

Efficient Non-linear Control Through Neuroevolution

15 years 8 months ago

Download www.idsia.ch

Abstract. Many complex control problems are not amenable to traditional controller design. Not only is it difficult to model real systems, but often it is unclear what kind of beha...

Faustino J. Gomez, Jürgen Schmidhuber, Risto ...

claim paper

Read More »

144

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 3 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

208

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 2 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

« Prev « First page 11 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers