Sciweavers

659 search results - page 19 / 132
» Tracking the Best Expert
Sort
View
NIPS
2007
13 years 11 months ago
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
We consider apprenticeship learning—learning from expert demonstrations—in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert...
J. Zico Kolter, Pieter Abbeel, Andrew Y. Ng
BCSHCI
2008
13 years 11 months ago
HCI: whose job is it anyway?
A panel of experts on human computer interaction (HCI) argues about who is best placed to `own' HCI and the user centred design process, The experts come from a range of priv...
Tom Stewart
JMLR
2010
103views more  JMLR 2010»
13 years 4 months ago
Regret Bounds and Minimax Policies under Partial Monitoring
This work deals with four classical prediction settings, namely full information, bandit, label efficient and bandit label efficient as well as four different notions of regret: p...
Jean-Yves Audibert, Sébastien Bubeck
RIDE
2002
IEEE
14 years 2 months ago
An Architecture for Assembling Agents that Participate in Alternative Heterogeneous Auctions
This paper addresses the issue of developing agents capable of participating in several potentially simultaneous auctions of different kinds (English, First-Price, Vickrey), with ...
Marlon Dumas, Guido Governatori, Arthur H. M. ter ...
COLT
2008
Springer
13 years 11 months ago
Extracting Certainty from Uncertainty: Regret Bounded by Variation in Costs
Prediction from expert advice is a fundamental problem in machine learning. A major pillar of the field is the existence of learning algorithms whose average loss approaches that ...
Elad Hazan, Satyen Kale