Search Sciweavers | Sciweavers

181 search results - page 13 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

NIPS
1994

152views Information Technology» more NIPS 1994»

Finding Structure in Reinforcement Learning

13 years 9 months ago

Download www.ri.cmu.edu

Reinforcement learning addresses the problem of learning to select actions in order to maximize one's performance inunknownenvironments. Toscale reinforcement learning to com...

Sebastian Thrun, Anton Schwartz

claim paper

Read More »

click to vote

ATAL
2003
Springer

151views Intelligent Agents» more ATAL 2003»

Representation and reasoning for DAML-based policy and domain services in KAoS and nomads

14 years 26 days ago

Download www.ihmc.us

To increase the assurance with which agents can be deployed in operational settings, we have been developing the KAoS policy and domain services. In conjunction with Nomads strong...

Jeffrey M. Bradshaw, Andrzej Uszok, Renia Jeffers,...

claim paper

Read More »

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

13 years 6 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

click to vote

SECON
2008
IEEE

174views Communications» more SECON 2008»

Optimal Buffer Management Policies for Delay Tolerant Networks

14 years 2 months ago

Download people.ee.ethz.ch

—Delay Tolerant Networks are wireless networks where disconnections may occur frequently due to propagation phenomena, node mobility, and power outages. Propagation delays may al...

Amir Krifa, Chadi Barakat, Thrasyvoulos Spyropoulo...

claim paper

Read More »

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

13 years 11 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

« Prev « First page 13 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers