Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

93

AAAI
2015

favoriteEmaildiscussreport

43views Intelligent Agents» more AAAI 2015»

Optimizing the CVaR via Sampling

9 years 3 months ago

Optimizing the CVaR via Sampling

Download tx.technion.ac.il

Conditional Value at Risk (CVaR) is a prominent risk measure that is being used extensively in various domains. We develop a new formula for the gradient of the CVaR in the form of a conditional expectation. Based on this formula, we propose a novel sampling-based estimator for the gradient of the CVaR, in the spirit of the likelihood-ratio method. We analyze the bias of the estimator, and prove the convergence of a corresponding stochastic gradient descent algorithm to a local CVaR optimum. Our method allows to consider CVaR optimization in new domains. As an example, we consider a reinforcement learning application, and learn a risksensitive controller for the game of Tetris.

Aviv Tamar, Yonatan Glassner, Shie Mannor

Real-time Traffic

AAAI 2015 | Computer Science |

claim paper

Related Content

» Sample average approximation of expected value constrained stochastic programs

» OnLine Search for Solving Markov Decision Processes via Heuristic Sampling

» Active Sampling for Rank Learning via Optimizing the Area under the ROC Curve

» Detecting Network Intrusions via Sampling A Game Theoretic Approach

» Jitter compensation in sampling via polynomial least squares estimation

» Robust bounds for classification via selective sampling

» Tracking of a NonRigid Object via Patchbased Dynamic Appearance Modeling and Adaptive Basi...

» Optimized distributed 2D transforms for irregularly sampled sensor network grids using wav...

» Improved Internet traffic analysis via optimized sampling

» Characterizing Optimal Sampling of Binary Contingency Tables via the Configuration Model

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Aviv Tamar, Yonatan Glassner, Shie Mannor

Comments (0)