Sciweavers

176 search results - page 19 / 36
» Optimal Sample Selection for Batch-mode Reinforcement Learni...
Sort
View
NIPS
2008
13 years 10 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
PKDD
2009
Springer
148views Data Mining» more  PKDD 2009»
14 years 3 months ago
Feature Selection by Transfer Learning with Linear Regularized Models
Abstract. This paper presents a novel feature selection method for classification of high dimensional data, such as those produced by microarrays. It includes a partial supervisio...
Thibault Helleputte, Pierre Dupont
IJCAI
2007
13 years 10 months ago
Using Linear Programming for Bayesian Exploration in Markov Decision Processes
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
Pablo Samuel Castro, Doina Precup
FOIKS
2008
Springer
14 years 6 months ago
Cost-minimising strategies for data labelling : optimal stopping and active learning
Supervised learning deals with the inference of a distribution over an output or label space $\CY$ conditioned on points in an observation space $\CX$, given a training dataset $D$...
Christos Dimitrakakis, Christian Savu-Krohn
COLT
2008
Springer
13 years 10 months ago
Teaching Dimensions based on Cooperative Learning
The problem of how a teacher and a learner can cooperate in the process of learning concepts from examples in order to minimize the required sample size without “coding tricks...
Sandra Zilles, Steffen Lange, Robert Holte, Martin...