Search Sciweavers | Sciweavers

337 search results - page 56 / 68

» Mean-Variance Optimization in Markov Decision Processes

137

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

15 years 10 months ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

162

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 8 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

151

click to vote

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 6 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

163

click to vote

AUTOMATICA
2007

124views more AUTOMATICA 2007»

Motion planning in uncertain environments with vision-like sensors

15 years 4 months ago

Download dnc.tamu.edu

In this work we present a methodology for intelligent path planning in an uncertain environment using vision like sensors, i.e., sensors that allow the sensing of the environment ...

Suman Chakravorty, John L. Junkins

claim paper

Read More »

142

click to vote

JAIR
2006

101views more JAIR 2006»

Resource Allocation Among Agents with MDP-Induced Preferences

15 years 4 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute acti...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 56 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers