Search Sciweavers | Sciweavers

20

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

14 years 8 months ago

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

27

click to vote

ICML
1996
IEEE

159views Machine Learning» more ICML 1996»

Discretizing Continuous Attributes While Learning Bayesian Networks

14 years 8 months ago

Download www.cs.huji.ac.il

We introduce a method for learning Bayesian networks that handles the discretization of continuous variables as an integral part of the learning process. The main ingredient in th...

Moisés Goldszmidt, Nir Friedman

claim paper

Read More »

23

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

14 years 2 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

24

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 9 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

22

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

13 years 5 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers