Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

109

ATAL
2010
Springer

favoriteEmaildiscussreport

141views Intelligent Agents» more ATAL 2010»

Risk-sensitive planning in partially observable environments

15 years 3 months ago

Risk-sensitive planning in partially observable environments

Download www.aamas-conference.org

Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is riskneutral in that it assumes that the agent is maximizing the expected reward of its actions. In contrast, in domains like financial planning, it is often required that the agent decisions are risk-sensitive (maximize the utility of agent actions, for non-linear utility functions). Unfortunately, existing POMDP solvers cannot solve such planning problems exactly. By considering piecewise linear approximations of utility functions, this paper addresses this shortcoming in three contributions: (i) It defines the Risk-Sensitive POMDP model; (ii) It derives the fundamental properties of the underlying value functions and provides a functional value iteration technique to compute them exactly and (c) It proposes an efficient procedure to determine the dominated value functions, to speed up the algorithm. Our experiments show t...

Janusz Marecki, Pradeep Varakantham

Real-time Traffic

ATAL 2010 | Intelligent Agents | Partially Observable Markov Decision Process | POMDP Model | Utility Functions |

claim paper

Related Content

» Planning with Continuous Actions in Partially Observable Environments

» Learning Planning Operators in RealWorld Partially Observable Environments

» Action representation and partially observable planning using epistemic logic

» Planning under Uncertainty for Robotic Tasks with Mixed Observability

» Acting Optimally in Partially Observable Stochastic Domains

» Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

» Supervision and diagnosis of joint actions in multiagent plans

» Planning and Acting in Uncertain Environments using Probabilistic Inference

» A Planning Algorithm for Predictive State Representations

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2010
Where	ATAL
Authors	Janusz Marecki, Pradeep Varakantham

Comments (0)