Sciweavers

908 search results - page 18 / 182
» Stochastic Finite Learning
Sort
View
ECML
2005
Springer
14 years 1 months ago
Towards Finite-Sample Convergence of Direct Reinforcement Learning
Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...
Shiau Hong Lim, Gerald DeJong
ICML
2010
IEEE
13 years 8 months ago
Finite-Sample Analysis of LSTD
In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...
ALT
2010
Springer
13 years 9 months ago
Lower Bounds on Learning Random Structures with Statistical Queries
We show that random DNF formulas, random log-depth decision trees and random deterministic finite acceptors cannot be weakly learned with a polynomial number of statistical queries...
Dana Angluin, David Eisenstat, Leonid Kontorovich,...
ICAPR
2005
Springer
14 years 1 months ago
Multi-view EM Algorithm for Finite Mixture Models
In this paper, Multi-View Expectation and Maximization algorithm for finite mixture models is proposed by us to handle realworld learning problems which have natural feature split...
Xing Yi, Yunpeng Xu, Changshui Zhang
ALENEX
2008
133views Algorithms» more  ALENEX 2008»
13 years 9 months ago
Comparing Online Learning Algorithms to Stochastic Approaches for the Multi-Period Newsvendor Problem
The multi-period newsvendor problem describes the dilemma of a newspaper salesman--how many paper should he purchase each day to resell, when he doesn't know the demand? We d...
Shawn O'Neil, Amitabh Chaudhary