Search Sciweavers | Sciweavers

231 search results - page 33 / 47

» Active Learning in Partially Observable Markov Decision Proc...

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

13 years 9 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

click to vote

JAIR
2008

130views more JAIR 2008»

Online Planning Algorithms for POMDPs

13 years 7 months ago

Download www.jair.org

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...

Stéphane Ross, Joelle Pineau, Sébast...

claim paper

Read More »

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

13 years 5 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

click to vote

ATAL
2007
Springer

129views Intelligent Agents» more ATAL 2007»

Subjective approximate solutions for decentralized POMDPs

14 years 1 months ago

Download www.cs.cmu.edu

A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DECPOMDPs) prov...

Anton Chechetka, Katia P. Sycara

claim paper

Read More »

« Prev « First page 33 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers