Representation Discovery for MDPs Using Bisimulation Metrics

10 years 3 months ago

Download www.cs.mcgill.ca

We provide a novel, ﬂexible, iterative reﬁnement algorithm to automatically construct an approximate statespace representation for Markov Decision Processes (MDPs). Our approach leverages bisimulation metrics, which have been used in prior work to generate features to represent the state space of MDPs. We address a drawback of this approach, which is the expensive computation of the bisimulation metrics. We propose an algorithm to generate an iteratively improving sequence of state space partitions. Partial metric computations guide the representation search and provide much lower space and computational complexity, while maintaining strong convergence properties. We provide theoretical results guaranteeing convergence as well as experimental illustrations of the accuracy and savings (in time and memory usage) of the new algorithm, compared to traditional bisimulation metric computation.

Sherry Shanshan Ruan, Gheorghe Comanici, Prakash P

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

» Model Minimization in Markov Decision Processes

» Determining Molecular Similarity for Drug Discovery using the Wavelet Riemannian Metric

» Algorithms for Network Topology Discovery using EndtoEnd Measurements

» Discovery of Collocation Patterns from Visual Words to Visual Phrases

» DDPIn Distance and density based protein indexing

» Comprehensible and Accurate Cluster Labels in Text Clustering

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Sherry Shanshan Ruan, Gheorghe Comanici, Prakash Panangaden, Doina Precup

Comments (0)

Sciweavers

Representation Discovery for MDPs Using Bisimulation Metrics

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Sciweavers