Search Sciweavers | Sciweavers

135 search results - page 8 / 27

» Bounded Parameter Markov Decision Processes

157

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 3 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

158

click to vote

ICMLA
2009

181views Machine Learning» more ICMLA 2009»

Sensitivity Analysis of POMDP Value Functions

15 years 3 months ago

Download www.cs.cmu.edu

In sequential decision making under uncertainty, as in many other modeling endeavors, researchers observe a dynamical system and collect data measuring its behavior over time. The...

Stéphane Ross, Masoumeh T. Izadi, Mark Merc...

claim paper

Read More »

145

click to vote

NIPS
2004

112views Information Technology» more NIPS 2004»

Learning first-order Markov models for control

15 years 6 months ago

Download books.nips.cc

First-order Markov models have been successfully applied to many problems, for example in modeling sequential data using Markov chains, and modeling control problems using the Mar...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

330

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 3 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

142

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Efficient Approximation of Optimal Control for Markov Games

15 years 5 months ago

Download react.cs.uni-sb.de

The success of probabilistic model checking for discrete-time Markov decision processes and continuous-time Markov chains has led to rich academic and industrial applications. The ...

Markus Rabe, Sven Schewe, Lijun Zhang

claim paper

Read More »

« Prev « First page 8 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers