Search Sciweavers | Sciweavers

91 search results - page 12 / 19

» Percentile Optimization for Markov Decision Processes with P...

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

11 years 10 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

ATAL
2007
Springer

129views Intelligent Agents» more ATAL 2007»

Subjective approximate solutions for decentralized POMDPs

14 years 1 months ago

Download www.cs.cmu.edu

A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DECPOMDPs) prov...

Anton Chechetka, Katia P. Sycara

claim paper

Read More »

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

13 years 11 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

13 years 9 months ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

« Prev « First page 12 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers