Search Sciweavers | Sciweavers

27 search results - page 3 / 6

» Policy Gradient Method for Team Markov Games

182

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

202

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

16 years 1 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

200

Voted

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Interaction-driven Markov games for decentralized multiagent planning under uncertainty

15 years 9 months ago

Download users.isr.ist.utl.pt

In this paper we propose interaction-driven Markov games (IDMGs), a new model for multiagent decision making under uncertainty. IDMGs aim at describing multiagent decision problem...

Matthijs T. J. Spaan, Francisco S. Melo

claim paper

Read More »

172

Voted

ROBOCUP
2001
Springer

96views Robotics» more ROBOCUP 2001»

Strategy Learning for a Team in Adversary Environments

15 years 12 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Team strategy acquisition is one of the most important issues of multiagent systems, especially in an adversary environment. RoboCup has been providing such an environment for AI a...

Yasutake Takahashi, Takashi Tamura, Minoru Asada

claim paper

Read More »

214

Voted

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

15 years 11 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

« Prev « First page 3 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers