Search Sciweavers | Sciweavers

107 search results - page 7 / 22

» Approximate Linear Programming for Constrained Partially Obs...

click to vote

ECAI
2010
Springer

227views Artificial Intelligence» more ECAI 2010»

On Finding Compromise Solutions in Multiobjective Markov Decision Processes

13 years 8 months ago

Download www-desir.lip6.fr

A Markov Decision Process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagen...

Patrice Perny, Paul Weng

claim paper

Read More »

click to vote

AIPS
2003

149views Artificial Intelligence» more AIPS 2003»

Synthesis of Hierarchical Finite-State Controllers for POMDPs

13 years 8 months ago

Download www.aaai.org

We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical ﬁnite-state control...

Eric A. Hansen, Rong Zhou

claim paper

Read More »

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Efficient Approximation of Optimal Control for Markov Games

13 years 7 months ago

Download react.cs.uni-sb.de

The success of probabilistic model checking for discrete-time Markov decision processes and continuous-time Markov chains has led to rich academic and industrial applications. The ...

Markus Rabe, Sven Schewe, Lijun Zhang

claim paper

Read More »

click to vote

JAIR
2006

122views more JAIR 2006»

Solving Factored MDPs with Hybrid State and Action Variables

13 years 7 months ago

Download www.jair.org

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automa...

Branislav Kveton, Milos Hauskrecht, Carlos Guestri...

claim paper

Read More »

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

14 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 7 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers