non-deterministic policies

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

13 years 10 months ago

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

AAAI
2006

86views Intelligent Agents» more AAAI 2006»

Targeting Specific Distributions of Trajectories in MDPs

14 years 4 months ago

Download www.cc.gatech.edu

We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...

David L. Roberts, Mark J. Nelson, Charles Lee Isbe...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers