Search Sciweavers | Sciweavers

102 search results - page 1 / 21

» MDPs with Non-Deterministic Policies

158

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

15 years 7 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

162

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

15 years 26 days ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

184

click to vote

AIPS
2011

216views Artificial Intelligence» more AIPS 2011»

Heuristic Search for Generalized Stochastic Shortest Path MDPs

14 years 9 months ago

Download www.cs.washington.edu

Research in efﬁcient methods for solving inﬁnite-horizon MDPs has so far concentrated primarily on discounted MDPs and the more general stochastic shortest path problems (SSPs...

Andrey Kolobov, Mausam, Daniel S. Weld, Hector Gef...

claim paper

Read More »

130

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 7 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

161

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

15 years 7 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

« Prev « First page 1 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers