Search Sciweavers | Sciweavers

908 search results - page 61 / 182

» Stochastic Finite Learning

184

click to vote

ICML
2006
IEEE

137views Machine Learning» more ICML 2006»

Predictive linear-Gaussian models of controlled stochastic dynamical systems

16 years 8 months ago

Download www.rudary.com

We introduce the controlled predictive linearGaussian model (cPLG), a model that uses predictive state to model discrete-time dynamical systems with real-valued observations and v...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

176

click to vote

COLT
2007
Springer

144views Machine Learning» more COLT 2007»

Improved Rates for the Stochastic Continuum-Armed Bandit Problem

16 years 1 months ago

Download www.sztaki.hu

Abstract. Considering one-dimensional continuum-armed bandit problems, we propose an improvement of an algorithm of Kleinberg and a new set of conditions which give rise to improve...

Peter Auer, Ronald Ortner, Csaba Szepesvári

claim paper

Read More »

216

click to vote

CORR
2010
Springer

98views Education» more CORR 2010»

Structure-Aware Stochastic Control for Transmission Scheduling

15 years 6 months ago

Download medianetlab.ee.ucla.edu

In this report, we consider the problem of real-time transmission scheduling over time-varying channels. We first formulate the transmission scheduling problem as a Markov decisio...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

217

click to vote

ACL
2008

123views Computational Linguistics» more ACL 2008»

Semi-Supervised Convex Training for Dependency Parsing

15 years 8 months ago

Download www.aclweb.org

We present a novel semi-supervised training algorithm for learning dependency parsers. By combining a supervised large margin loss with an unsupervised least squares loss, a discr...

Qin Iris Wang, Dale Schuurmans, Dekang Lin

claim paper

Read More »

193

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

« Prev « First page 61 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers