Search Sciweavers | Sciweavers

337 search results - page 6 / 68

» Mean-Variance Optimization in Markov Decision Processes

154

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 6 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

127

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

15 years 10 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

145

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 6 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

130

click to vote

AAAI
2004

117views Intelligent Agents» more AAAI 2004»

Solving Concurrent Markov Decision Processes

15 years 6 months ago

Download www.cs.washington.edu

Typically, Markov decision problems (MDPs) assume a single action is executed per decision epoch, but in the real world one may frequently execute certain actions in parallel. Thi...

Mausam, Daniel S. Weld

claim paper

Read More »

151

click to vote

GLVLSI
2005
IEEE

118views VLSI» more GLVLSI 2005»

A continuous time markov decision process based on-chip buffer allocation methodology

15 years 10 months ago

Download www.ece.sunysb.edu

We have presented an optimal on-chip buﬀer allocation and buﬀer insertion methodology which uses stochastic models of the architecture. This methodology uses ﬁnite buﬀer s...

Sankalp Kallakuri, Nattawut Thepayasuwan, Alex Dob...

claim paper

Read More »

« Prev « First page 6 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers