Search Sciweavers | Sciweavers

699 search results - page 13 / 140

» Online Dynamic Value System for Machine Learning

200

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 7 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

187

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 7 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

289

click to vote

COLT
2010
Springer

238views Machine Learning» more COLT 2010»

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

15 years 4 months ago

Download www.colt2010.org

We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradie...

John Duchi, Elad Hazan, Yoram Singer

claim paper

Read More »

167

click to vote

DEDS
2006

101views more DEDS 2006»

Near-Optimal Online Control of Dynamic Discrete-Event Systems

15 years 6 months ago

Download research.cs.queensu.ca

A class of time-varying discrete-event systems, named dynamic discrete-event systems, is defined. The goal of this paper is to provide a method which is modular and can be applied ...

Lenko Grigorov, Karen Rudie

claim paper

Read More »

159

click to vote

ICML
2003
IEEE

168views Machine Learning» more ICML 2003»

Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning

16 years 7 months ago

Download webee.technion.ac.il

We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 13 / 140 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers