Search Sciweavers | Sciweavers

908 search results - page 18 / 182

» Stochastic Finite Learning

155

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

16 years 16 days ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

177

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 8 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

161

Voted

ALT
2010
Springer

224views Machine Learning» more ALT 2010»

Lower Bounds on Learning Random Structures with Statistical Queries

15 years 8 months ago

Download www.cc.gatech.edu

We show that random DNF formulas, random log-depth decision trees and random deterministic finite acceptors cannot be weakly learned with a polynomial number of statistical queries...

Dana Angluin, David Eisenstat, Leonid Kontorovich,...

claim paper

Read More »

211

click to vote

ICAPR
2005
Springer

113views Pattern Recognition» more ICAPR 2005»

Multi-view EM Algorithm for Finite Mixture Models

16 years 15 days ago

Download www.cs.umass.edu

In this paper, Multi-View Expectation and Maximization algorithm for ﬁnite mixture models is proposed by us to handle realworld learning problems which have natural feature split...

Xing Yi, Yunpeng Xu, Changshui Zhang

claim paper

Read More »

169

click to vote

ALENEX
2008

133views Algorithms» more ALENEX 2008»

Comparing Online Learning Algorithms to Stochastic Approaches for the Multi-Period Newsvendor Problem

15 years 8 months ago

Download www.nd.edu

The multi-period newsvendor problem describes the dilemma of a newspaper salesman--how many paper should he purchase each day to resell, when he doesn't know the demand? We d...

Shawn O'Neil, Amitabh Chaudhary

claim paper

Read More »

« Prev « First page 18 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers