Search Sciweavers | Sciweavers

1176 search results - page 37 / 236

» Sparse reward processes

131

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 4 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

134

Voted

ATAL
2003
Springer

152views Intelligent Agents» more ATAL 2003»

Transition-independent decentralized markov decision processes

15 years 9 months ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of m...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

131

click to vote

AIPS
2008

148views Artificial Intelligence» more AIPS 2008»

Bounded-Parameter Partially Observable Markov Decision Processes

15 years 6 months ago

Download www.aaai.org

The POMDP is considered as a powerful model for planning under uncertainty. However, it is usually impractical to employ a POMDP with exact parameters to model precisely the real-...

Yaodong Ni, Zhi-Qiang Liu

claim paper

Read More »

127

Voted

AAAI
1997

133views Intelligent Agents» more AAAI 1997»

Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes

15 years 5 months ago

Download www.cs.pitt.edu

Partially observable Markov decision processes (POMDPs) allow one to model complex dynamic decision or control problems that include both action outcome uncertainty and imperfect ...

Milos Hauskrecht

claim paper

Read More »

115

click to vote

CDC
2009
IEEE

169views Control Systems» more CDC 2009»

Parametric regret in uncertain Markov decision processes

15 years 8 months ago

Download www.cim.mcgill.ca

— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...

Huan Xu, Shie Mannor

claim paper

Read More »

« Prev « First page 37 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers