Search Sciweavers | Sciweavers

87 search results - page 8 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 7 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

13 years 11 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

click to vote

ICTAI
2007
IEEE

96views Artificial Intelligence» more ICTAI 2007»

Multi-criteria Decision Making for Local Coordination in Multi-agent Systems

14 years 1 months ago

Download users.info.unicaen.fr

Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conﬂicts between individual and group interests. In this paper, we are using a Decentralized Ve...

Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...

claim paper

Read More »

click to vote

ICMLA
2008

106views Machine Learning» more ICMLA 2008»

Prediction-Directed Compression of POMDPs

13 years 9 months ago

Download damas.ift.ulaval.ca

High dimensionality of belief space in Partially Observable Markov Decision Processes (POMDPs) is one of the major causes that severely restricts the applicability of this model. ...

Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chai...

claim paper

Read More »

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

13 years 9 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

« Prev « First page 8 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers