Sciweavers

147 search results - page 6 / 30
» Policy Gradient in Continuous Time
Sort
View
ICML
2006
IEEE
14 years 10 months ago
Probabilistic inference for solving discrete and continuous state Markov Decision Processes
Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...
Marc Toussaint, Amos J. Storkey
IANDC
2007
94views more  IANDC 2007»
13 years 9 months ago
Mediating secure information flow policies
In this paper we study secure information flow policies in the sense of Meadows [12] and others for aggregated datasets, collectively. We first present a method for combining di...
Guo-Qiang Zhang
ICCV
2005
IEEE
14 years 3 months ago
Conformal Metrics and True "Gradient Flows" for Curves
We wish to endow the manifold M of smooth curves in lRn with a Riemannian metric that allows us to treat continuous morphs (homotopies) between two curves c0 and c1 as trajectorie...
Anthony J. Yezzi, Andrea Mennucci
ICRA
2010
IEEE
145views Robotics» more  ICRA 2010»
13 years 8 months ago
Reinforcement learning of motor skills in high dimensions: A path integral approach
— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
IOR
2008
90views more  IOR 2008»
13 years 9 months ago
Optimal Position-Based Warehouse Ordering in Divergent Two-Echelon Inventory Systems
: A continuous review two-echelon inventory system with a central warehouse and a number of non-identical retailers is considered. The retailers face independent Poisson demand and...
Sven Axsäter, Johan Marklund