Search Sciweavers | Sciweavers

83 search results - page 14 / 17

» Planning and Acting in Partially Observable Stochastic Domai...

click to vote

NIPS
2008

96views Information Technology» more NIPS 2008»

Multi-Agent Filtering with Infinitely Nested Beliefs

14 years 9 days ago

Download www.cs.washington.edu

In partially observable worlds with many agents, nested beliefs are formed when agents simultaneously reason about the unknown state of the world and the beliefs of the other agen...

Luke S. Zettlemoyer, Brian Milch, Leslie Pack Kael...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

14 years 4 months ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

14 years 10 days ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

14 years 4 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

« Prev « First page 14 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers