Sciweavers

360 search results - page 13 / 72
» Learning Evaluation Functions for Large Acyclic Domains
Sort
View
RAS
2010
117views more  RAS 2010»
13 years 7 months ago
Extending BDI plan selection to incorporate learning from experience
An important drawback to the popular Belief, Desire, and Intentions (BDI) paradigm is that such systems include no element of learning from experience. We describe a novel BDI exe...
Dhirendra Singh, Sebastian Sardiña, Lin Pad...
NIPS
2004
13 years 10 months ago
A Large Deviation Bound for the Area Under the ROC Curve
The area under the ROC curve (AUC) has been advocated as an evaluation criterion for the bipartite ranking problem. We study large deviation properties of the AUC; in particular, ...
Shivani Agarwal, Thore Graepel, Ralf Herbrich, Dan...
ICML
2010
IEEE
13 years 9 months ago
On the Consistency of Ranking Algorithms
We present a theoretical analysis of supervised ranking, providing necessary and sufficient conditions for the asymptotic consistency of algorithms based on minimizing a surrogate...
John Duchi, Lester W. Mackey, Michael I. Jordan
CORR
2010
Springer
138views Education» more  CORR 2010»
13 years 5 months ago
Rules of Thumb for Information Acquisition from Large and Redundant Data
We develop an abstract model of information acquisition from redundant data. We assume a random sampling process from data which contain information with bias and are interested in...
Wolfgang Gatterbauer
CORR
2011
Springer
161views Education» more  CORR 2011»
13 years 11 days ago
Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...
Miroslav Dudík, John Langford, Lihong Li