Online Learning with Expert Advice and Finite-Horizon Constraints

15 years 9 months ago

Download www.aaai.org

In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novelty of our setup is that the rewards and constraints are controlled by an adverse opponent. To solve our problem in a practical way, we propose an expert algorithm that guarantees both a vanishing regret and a sublinear number of violated constraints. The quality of this solution is demonstrated on a real-world power management problem. Our results support the hypothesis that online learning with convex cost constraints can be performed successfully in practice.

Branislav Kveton, Jia Yuan Yu, Georgios Theocharou

Real-time Traffic

AAAI 2008 | Cost Constraints | Decision Making Problem | Intelligent Agents | Temporal Cost Constraints |

claim paper

» Providing Expert Advice by Analogy for OnLine Help

» Realtime ranking with concept drift using expert advice

» Prediction with Expert Advice by Following the Perturbed Leader for General Weights

» Online Learning in Case of Unbounded Losses Using Follow the Perturbed Leader Algorithm

» Defensive Universal Learning with Experts

» Hannan Consistency in OnLine Learning in Case of Unbounded Losses Under Partial Monitoring

» Weak aggregating algorithm for the distributionfree perishable inventory problem

» Online Prediction with Privacy

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2008
Where	AAAI
Authors	Branislav Kveton, Jia Yuan Yu, Georgios Theocharous, Shie Mannor

Comments (0)

Sciweavers

Online Learning with Expert Advice and Finite-Horizon Constraints

AAAI 2008 | Cost Constraints | Decision Making Problem | Intelligent Agents | Temporal Cost Constraints |

Explore & Download

Productivity Tools

Sciweavers