Search Sciweavers | Sciweavers

43 search results - page 6 / 9

» The O.D.E. Method for Convergence of Stochastic Approximatio...

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

13 years 8 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

click to vote

NIPS
1996

106views Information Technology» more NIPS 1996»

Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems

13 years 10 months ago

Download www.cis.upenn.edu

In cellular telephone systems, an important problem is to dynamically allocate the communication resource channels so as to maximize service in a stochastic caller environment. Th...

Satinder P. Singh, Dimitri P. Bertsekas

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 8 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

13 years 10 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

14 years 9 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

« Prev « First page 6 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers