Search Sciweavers | Sciweavers

337 search results - page 46 / 68

» Mean-Variance Optimization in Markov Decision Processes

144

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

15 years 6 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

165

Voted

CONCUR
2006
Springer

159views Distributed And Parallel Com...» more CONCUR 2006»

Strategy Improvement for Stochastic Rabin and Streett Games

15 years 9 months ago

Download mtc.epfl.ch

A stochastic graph game is played by two players on a game graph with probabilistic transitions. We consider stochastic graph games with -regular winning conditions specified as Ra...

Krishnendu Chatterjee, Thomas A. Henzinger

claim paper

Read More »

148

click to vote

AAAI
2006

86views Intelligent Agents» more AAAI 2006»

Targeting Specific Distributions of Trajectories in MDPs

15 years 6 months ago

Download www.cc.gatech.edu

We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...

David L. Roberts, Mark J. Nelson, Charles Lee Isbe...

claim paper

Read More »

149

click to vote

AAAI
2006

134views Intelligent Agents» more AAAI 2006»

Point-based Dynamic Programming for DEC-POMDPs

15 years 6 months ago

Download hal.archives-ouvertes.fr

We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...

Daniel Szer, François Charpillet

claim paper

Read More »

178

click to vote

AIPS
2006

211views Artificial Intelligence» more AIPS 2006»

Solving Factored MDPs with Exponential-Family Transition Models

15 years 6 months ago

Download www.cs.pitt.edu

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

« Prev « First page 46 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers