Search Sciweavers | Sciweavers

176 search results - page 9 / 36

» Optimal Sample Selection for Batch-mode Reinforcement Learni...

157

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

177

click to vote

ICML
2009
IEEE

142views Machine Learning» more ICML 2009»

Robust bounds for classification via selective sampling

16 years 7 months ago

Download homes.dsi.unimi.it

We introduce a new algorithm for binary classification in the selective sampling protocol. Our algorithm uses Regularized Least Squares (RLS) as base classifier, and for this reas...

Nicolò Cesa-Bianchi, Claudio Gentile, Franc...

claim paper

Read More »

198

click to vote

ECIR
2009
Springer

285views Information Technology» more ECIR 2009»

Active Sampling for Rank Learning via Optimizing the Area under the ROC Curve

16 years 3 months ago

Download www.cs.cmu.edu

Abstract. Learning ranking functions is crucial for solving many problems, ranging from document retrieval to building recommendation systems based on an individual user’s prefer...

Pinar Donmez, Jaime G. Carbonell

claim paper

Read More »

206

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 11 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

172

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 9 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers