Search Sciweavers | Sciweavers

201 search results - page 36 / 41

» Solving Concurrent Markov Decision Processes

184

Voted

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

16 years 27 days ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

209

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

183

Voted

AI
2006
Springer

167views Artificial Intelligence» more AI 2006»

Belief Selection in Point-Based Planning Algorithms for POMDPs

15 years 10 months ago

Download www.cs.mcgill.ca

Abstract. Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value funct...

Masoumeh T. Izadi, Doina Precup, Danielle Azar

claim paper

Read More »

198

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

15 years 9 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

203

Voted

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 8 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

« Prev « First page 36 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers