Search Sciweavers | Sciweavers

509 search results - page 20 / 102

» Using Learning for Approximation in Stochastic Processes

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

14 years 1 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

AUTOMATICA
2006

152views more AUTOMATICA 2006»

Simulation-based optimization of process control policies for inventory management in supply chains

13 years 7 months ago

Download www.jhuapl.edu

A simulation-based optimization framework involving simultaneous perturbation stochastic approximation (SPSA) is presented as a means for optimally specifying parameters of intern...

Jay D. Schwartz, Wenlin Wang, Daniel E. Rivera

claim paper

Read More »

click to vote

ICML
2010
IEEE

287views Machine Learning» more ICML 2010»

Rectified Linear Units Improve Restricted Boltzmann Machines

13 years 8 months ago

Download www.icml2010.org

Restricted Boltzmann machines were developed using binary stochastic hidden units. These can be generalized by replacing each binary unit by an infinite number of copies that all ...

Vinod Nair, Geoffrey E. Hinton

claim paper

Read More »

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

13 years 2 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 20 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers