Sciweavers

ICML
2010
IEEE
14 years 17 days ago
Nonparametric Return Distribution Approximation for Reinforcement Learning
Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...
Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...