Search Sciweavers | Sciweavers

113 search results - page 11 / 23

» Learning Representation and Control in Continuous Markov Dec...

186

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 8 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

200

click to vote

CORR
2012
Springer

193views Education» more CORR 2012»

A Unifying Framework for Linearly Solvable Control

14 years 3 months ago

Download www.cs.washington.edu

Recent work has led to the development of an elegant theory of Linearly Solvable Markov Decision Processes (LMDPs) and related Path-Integral Control Problems. Traditionally, LMDPs...

Krishnamurthy Dvijotham, Emanuel Todorov

claim paper

Read More »

222

Voted

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

225

Voted

TIP
2008

169views more TIP 2008»

Weakly Supervised Learning of a Classifier for Unusual Event Detection

15 years 7 months ago

Download hci.iwr.uni-heidelberg.de

In this paper, we present an automatic classification framework combining appearance based features and Hidden Markov Models (HMM) to detect unusual events in image sequences. One...

Mark Jager, Christian Knoll, Fred A. Hamprecht

claim paper

Read More »

196

click to vote

CORR
2010
Springer

106views Education» more CORR 2010»

MDPs with Unawareness

15 years 7 months ago

Download www.cs.cornell.edu

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...

Joseph Y. Halpern, Nan Rong, Ashutosh Saxena

claim paper

Read More »

« Prev « First page 11 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers