Search Sciweavers | Sciweavers

113 search results - page 9 / 23

» Learning Representation and Control in Continuous Markov Dec...

click to vote

JMLR
2006

116views more JMLR 2006»

Point-Based Value Iteration for Continuous POMDPs

13 years 9 months ago

Download jmlr.csail.mit.edu

We propose a novel approach to optimize Partially Observable Markov Decisions Processes (POMDPs) defined on continuous spaces. To date, most algorithms for model-based POMDPs are ...

Josep M. Porta, Nikos A. Vlassis, Matthijs T. J. S...

claim paper

Read More »

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

14 years 3 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

click to vote

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

13 years 10 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 9 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

14 years 9 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

« Prev « First page 9 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers