Search Sciweavers | Sciweavers

164 search results - page 25 / 33

» Stochastic MINLP optimization using simplicial approximation

164

click to vote

NIPS
2001

101views Information Technology» more NIPS 2001»

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay

15 years 7 months ago

Download books.nips.cc

Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...

Michael Kositsky, Andrew G. Barto

claim paper

Read More »

216

Voted

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 28 days ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

179

click to vote

UAI
1998

99views Artificial Intelligence» more UAI 1998»

Flexible Decomposition Algorithms for Weakly Coupled Markov Decision Problems

15 years 7 months ago

Download reference.kfupm.edu.sa

This paper presents two new approaches to decomposing and solving large Markov decision problems (MDPs), a partial decoupling method and a complete decoupling method. In these app...

Ronald Parr

claim paper

Read More »

166

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 11 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

163

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 7 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

« Prev « First page 25 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers