Search Sciweavers | Sciweavers

97 search results - page 18 / 20

» An epsilon-Optimal Grid-Based Algorithm for Partially Observ...

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

14 years 23 days ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

FSR
2003
Springer

94views Robotics» more FSR 2003»

Planning under Uncertainty for Reliable Health Care Robotics

14 years 21 days ago

Download www.cs.cmu.edu

We describe a mobile robot system, designed to assist residents of an retirement facility. This system is being developed to respond to an aging population and a predicted shortage...

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

« Prev « First page 18 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers