Search Sciweavers | Sciweavers

60 search results - page 6 / 12

» Region-based value iteration for partially observable Markov...

click to vote

AAAI
2007

131views Intelligent Agents» more AAAI 2007»

Scaling Up: Solving POMDPs through Value Based Clustering

13 years 9 months ago

Download www.aaai.org

Partially Observable Markov Decision Processes (POMDPs) provide an appropriately rich model for agents operating under partial knowledge of the environment. Since ﬁnding an opti...

Yan Virin, Guy Shani, Solomon Eyal Shimony, Ronen ...

claim paper

Read More »

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

13 years 8 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

click to vote

ICTAI
2010
IEEE

226views Artificial Intelligence» more ICTAI 2010»

A Closer Look at MOMDPs

13 years 5 months ago

Download www.loria.fr

Abstract--The difficulties encountered in sequential decisionmaking problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of th...

Mauricio Araya-López, Vincent Thomas, Olivi...

claim paper

Read More »

click to vote

NIPS
2008

132views Information Technology» more NIPS 2008»

Bayesian Model of Behaviour in Economic Games

13 years 9 months ago

Download www.gatsby.ucl.ac.uk

Classical game theoretic approaches that make strong rationality assumptions have difficulty modeling human behaviour in economic games. We investigate the role of finite levels o...

Debajyoti Ray, Brooks King-Casas, P. Read Montague...

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 6 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 6 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers