Sciweavers

CORR
2011
Springer
161views Education» more  CORR 2011»
12 years 10 months ago
Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...
Miroslav Dudík, John Langford, Lihong Li