MDP planning | Sciweavers

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

14 years 4 months ago

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

click to vote

SARA
2007
Springer

152views Artificial Intelligence» more SARA 2007»

Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning

14 years 8 months ago

Download www.cs.umd.edu

Abstract. We describe a way to improve the performance of MDP planners by modifying them to use lower and upper bounds to eliminate non-optimal actions during their search. First, ...

Ugur Kuter, Jiaqiao Hu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers