minimax regret | Sciweavers

42

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 10 months ago

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

30

click to vote

AI
2006
Springer

109views Artificial Intelligence» more AI 2006»

Constraint-based optimization and utility elicitation using the minimax decision criterion

14 years 20 days ago

Download www.cs.duke.edu

In many situations, a set of hard constraints encodes the feasible configurations of some system or product over which multiple users have distinct preferences. However, making su...

Craig Boutilier, Relu Patrascu, Pascal Poupart, Da...

claim paper

Read More »

36

click to vote

IJCAI
2003

119views Artificial Intelligence» more IJCAI 2003»

Incremental Utility Elicitation with the Minimax Regret Decision Criterion

14 years 2 months ago

Download www.cs.toronto.edu

Utility elicitation is a critical function of any automated decision aid, allowing decisions to be tailored to the preferences of a speciﬁc user. However, the size and complexit...

Tianhan Wang, Craig Boutilier

claim paper

Read More »

38

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

14 years 2 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

36

click to vote

CDC
2009
IEEE

169views Control Systems» more CDC 2009»

Parametric regret in uncertain Markov decision processes

14 years 5 months ago

Download www.cim.mcgill.ca

— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...

Huan Xu, Shie Mannor

claim paper

Read More »

42

click to vote

SIGECOM
2010
ACM

183views ECommerce» more SIGECOM 2010»

Assessing regret-based preference elicitation with the UTPREF recommendation system

14 years 5 months ago

Download www.cs.toronto.edu

Product recommendation and decision support systems must generally develop a model of user preferences by querying or otherwise interacting with a user. Recent approaches to elici...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

42

click to vote

RECSYS
2009
ACM

105views Control Systems» more RECSYS 2009»

Preference elicitation with subjective features

14 years 7 months ago

Download www.cs.toronto.edu

Utility or preference elicitation is a critical component in many recommender and decision support systems. However, most frameworks for elicitation assume a predeﬁned set of fe...

Craig Boutilier, Kevin Regan, Paolo Viappiani

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers