Search Sciweavers | Sciweavers

176 search results - page 15 / 36

» Optimal Sample Selection for Batch-mode Reinforcement Learni...

click to vote

ICASSP
2011
IEEE

123views Signal Processing» more ICASSP 2011»

A kernelized maximal-figure-of-merit learning approach based on subspace distance minimization

12 years 11 months ago

Download mirlab.org

We propose a kernelized maximal-ﬁgure-of-merit (MFoM) learning approach to efﬁciently training a nonlinear model using subspace distance minimization. In particular, a ﬁxed,...

Byungki Byun, Chin-Hui Lee

claim paper

Read More »

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 9 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

14 years 1 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

click to vote

SDM
2008
SIAM

144views Data Mining» more SDM 2008»

Active Learning with Model Selection in Linear Regression

13 years 9 months ago

Download hrstc.org

Optimally designing the location of training input points (active learning) and choosing the best model (model selection) are two important components of supervised learning and h...

Masashi Sugiyama, Neil Rubens

claim paper

Read More »

click to vote

CORR
2010
Springer

125views Education» more CORR 2010»

Near-Optimal Bayesian Active Learning with Noisy Observations

13 years 7 months ago

Download www.cs.caltech.edu

We tackle the fundamental problem of Bayesian active learning with noise, where we need to adaptively select from a number of expensive tests in order to identify an unknown hypot...

Daniel Golovin, Andreas Krause, Debajyoti Ray

claim paper

Read More »

« Prev « First page 15 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers