Search Sciweavers | Sciweavers

473 search results - page 63 / 95

» Optimal policy switching algorithms for reinforcement learni...

184

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 5 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

224

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 1 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

149

click to vote

DSP
2006

161views Emerging Technology» more DSP 2006»

Adaptive multi-modality sensor scheduling for detection and tracking of smart targets

15 years 5 months ago

Download www-personal.umich.edu

This paper considers the problem of sensor scheduling for the purposes of detection and tracking of "smart" targets. Smart targets are targets that can detect when they ...

Christopher M. Kreucher, Doron Blatt, Alfred O. He...

claim paper

Read More »

152

click to vote

AUSAI
2005
Springer

166views Artificial Intelligence» more AUSAI 2005»

Adaptive Utility-Based Scheduling in Resource-Constrained Systems

15 years 11 months ago

Download labs.oracle.com

This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...

David Vengerov

claim paper

Read More »

162

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

« Prev « First page 63 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers