Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

109

ICML
2007
IEEE

favoriteEmaildiscussreport

171views Machine Learning» more ICML 2007»

Percentile optimization in uncertain Markov decision processes with application to efficient exploration

16 years 2 months ago

Percentile optimization in uncertain Markov decision processes with application to efficient exploration

Download www.machinelearning.org

Markov decision processes are an effective tool in modeling decision-making in uncertain dynamic environments. Since the parameters of these models are typically estimated from data, learned from experience, or designed by hand, it is not surprising that the actual performance of a chosen strategy often significantly differs from the designer's initial expectations due to unavoidable model uncertainty. In this paper, we present a percentile criterion that captures the trade-off between optimistic and pessimistic points of view on MDP with parameter uncertainty. We describe tractable methods that take parameter uncertainty into account in the process of decision making. Finally, we propose a costeffective exploration strategy when it is possible to invest (money, time or computation efforts) in actions that will reduce the uncertainty in the parameters.

Erick Delage, Shie Mannor

Real-time Traffic

ICML 2007 | Machine Learning | Markov Decision Processes | Parameter Uncertainty | Unavoidable Model Uncertainty |

claim paper

Related Content

» Decision Making in Uncertain RealWorld Domains Using DTGolog

» An intrinsic reward mechanism for efficient exploration

» Efficient Approximation of Optimal Control for Markov Games

» Motion planning in uncertain environments with visionlike sensors

» Optimizing mpf queries decision support and probabilistic inference

» QoS Routing in Networks with Uncertain Parameters

» Efficient Algorithms for Decision Tree Crossvalidation

» Symmetric approximate linear programming for factored MDPs with application to constrained...

» Winning back the CUP for distributed POMDPs planning over continuous belief spaces

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2007
Where	ICML
Authors	Erick Delage, Shie Mannor

Comments (0)