Search Sciweavers | Sciweavers

360 search results - page 13 / 72

» Learning Evaluation Functions for Large Acyclic Domains

click to vote

RAS
2010

117views more RAS 2010»

Extending BDI plan selection to incorporate learning from experience

13 years 7 months ago

Download goanna.cs.rmit.edu.au

An important drawback to the popular Belief, Desire, and Intentions (BDI) paradigm is that such systems include no element of learning from experience. We describe a novel BDI exe...

Dhirendra Singh, Sebastian Sardiña, Lin Pad...

claim paper

Read More »

click to vote

NIPS
2004

109views Information Technology» more NIPS 2004»

A Large Deviation Bound for the Area Under the ROC Curve

13 years 10 months ago

Download books.nips.cc

The area under the ROC curve (AUC) has been advocated as an evaluation criterion for the bipartite ranking problem. We study large deviation properties of the AUC; in particular, ...

Shivani Agarwal, Thore Graepel, Ralf Herbrich, Dan...

claim paper

Read More »

click to vote

ICML
2010
IEEE

259views Machine Learning» more ICML 2010»

On the Consistency of Ranking Algorithms

13 years 9 months ago

Download www.cs.berkeley.edu

We present a theoretical analysis of supervised ranking, providing necessary and sufficient conditions for the asymptotic consistency of algorithms based on minimizing a surrogate...

John Duchi, Lester W. Mackey, Michael I. Jordan

claim paper

Read More »

click to vote

CORR
2010
Springer

138views Education» more CORR 2010»

Rules of Thumb for Information Acquisition from Large and Redundant Data

13 years 5 months ago

Download www.cs.washington.edu

We develop an abstract model of information acquisition from redundant data. We assume a random sampling process from data which contain information with bias and are interested in...

Wolfgang Gatterbauer

claim paper

Read More »

click to vote

CORR
2011
Springer

161views Education» more CORR 2011»

Doubly Robust Policy Evaluation and Learning

13 years 11 days ago

Download www.icml-2011.org

We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...

Miroslav Dudík, John Langford, Lihong Li

claim paper

Read More »

« Prev « First page 13 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers