Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

216

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 8 months ago

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need a perfect knowledge of the structure of the problem. In this paper, we propose sdyna, a general framework for addressing large reinforcement learning problems by trial-and-error and with no initial knowledge of their structure. sdyna integrates incremental planning algorithms based on fmdps with supervised learning techniques building structured representations of the problem. We describe spiti, an instantiation of sdyna, that uses incremental decision tree induction to learn the structure of a problem combined with an incremental version of the Structured Value Iteration algorithm. We show that spiti can build a factored representation of a reinforcement learning problem and may improve the policy faster than tabular reinforcement learning algorithms by exploiting the generalization property of decision tree...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille

Real-time Traffic

ICML 2006 | Incremental Planning Algorithms | Machine Learning | Reinforcement Learning Algorithms | Reinforcement Learning Problem |

claim paper

Related Content

» A Simulationbased Approach for Solving Generalized SemiMarkov Decision Processes

» Considering Unseen States as Impossible in Factored Reinforcement Learning

» Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

» Automatic Feature Selection for ModelBased Reinforcement Learning in Factored MDPs

» Solving multiagent assignment Markov decision processes

» Generating Hierarchical Structure in Reinforcement Learning from State Variables

» Optimism in Reinforcement Learning Based on KullbackLeibler Divergence

» Using Free Energies to Represent Qvalues in a Multiagent Reinforcement Learning Task

» Reinforcement learning with limited reinforcement using Bayes risk for active learning in ...

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2006
Where	ICML
Authors	Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin

Comments (0)