Search Sciweavers | Sciweavers

107 search results - page 8 / 22

» Approximate Linear Programming for Constrained Partially Obs...

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

14 years 1 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

IJCAI
2007

160views Artificial Intelligence» more IJCAI 2007»

Learning from Partial Observations

13 years 9 months ago

Download www.ijcai.org

We present a general machine learning framework for modelling the phenomenon of missing information in data. We propose a masking process model to capture the stochastic nature of...

Loizos Michael

claim paper

Read More »

click to vote

UAI
2008

230views Artificial Intelligence» more UAI 2008»

Partitioned Linear Programming Approximations for MDPs

13 years 9 months ago

Download uai2008.cs.helsinki.fi

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

IJRR
2010

162views more IJRR 2010»

Planning under Uncertainty for Robotic Tasks with Mixed Observability

13 years 6 months ago

Download motion.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) provide a principled, general framework for robot motion planning in uncertain and dynamic environments. They have been app...

Sylvie C. W. Ong, Shao Wei Png, David Hsu, Wee Sun...

claim paper

Read More »

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

13 years 8 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 8 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers