Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

49

ICML
2001
IEEE

favoriteEmaildiscussreport

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

15 years 7 months ago

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in single agent systems as well as multiagent systems and multirobot systems. We prove that if an MDP possesses a symmetry, then the optimal value function and Q function are similarly symmetric and there exists a symmetric optimal policy. If an MDP is known to possess a symmetry, this knowledge can be applied to decrease the number of training examples needed for algorithms like Q learning and value iteration. It can also be used to directly restrict the hypothesis space.

Martin Zinkevich, Tucker R. Balch

Real-time Traffic

ICML 2001 | Machine Learning | Optimal Value Function | Single Agent Systems | Symmetric Optimal Policy |

claim paper

Related Content

» Communication in MultiAgent Markov Decision Processes

» Markov Games as a Framework for MultiAgent Reinforcement Learning

» ValueDirected Human Behavior Analysis from Video Using Partially Observable Markov Decisio...

» Towards WellDefined Multiagent Reinforcement Learning

» MBAIMFSI a model based framework for exploiting gradient ascent multiagent learners in str...

» Adjustable autonomy in realworld multiagent environments

» Decentralized planning under uncertainty for teams of communicating agents

» Selforganization for coordinating decentralized reinforcement learning

» Using iterated reasoning to predict opponent strategies

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2001
Where	ICML
Authors	Martin Zinkevich, Tucker R. Balch

Comments (0)