Search Sciweavers | Sciweavers

176 search results - page 19 / 36

» Optimal Sample Selection for Batch-mode Reinforcement Learni...

143

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 4 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

126

click to vote

PKDD
2009
Springer

148views Data Mining» more PKDD 2009»

Feature Selection by Transfer Learning with Linear Regularized Models

15 years 10 months ago

Download www.info.ucl.ac.be

Abstract. This paper presents a novel feature selection method for classiﬁcation of high dimensional data, such as those produced by microarrays. It includes a partial supervisio...

Thibault Helleputte, Pierre Dupont

claim paper

Read More »

132

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 4 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

174

click to vote

FOIKS
2008
Springer

358views Information Technology» more FOIKS 2008»

Cost-minimising strategies for data labelling : optimal stopping and active learning

16 years 10 days ago

Download arxiv.org

Supervised learning deals with the inference of a distribution over an output or label space $\CY$ conditioned on points in an observation space $\CX$, given a training dataset $D$...

Christos Dimitrakakis, Christian Savu-Krohn

posted by olethros

Read More »

126

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Teaching Dimensions based on Cooperative Learning

15 years 4 months ago

Download colt2008.cs.helsinki.fi

The problem of how a teacher and a learner can cooperate in the process of learning concepts from examples in order to minimize the required sample size without “coding tricks�...

Sandra Zilles, Steffen Lange, Robert Holte, Martin...

claim paper

Read More »

« Prev « First page 19 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers