Sciweavers

CORR
2011
Springer
161views Education» more  CORR 2011»
13 years 3 months ago
Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...
Miroslav Dudík, John Langford, Lihong Li