Search Sciweavers | Sciweavers

771 search results - page 63 / 155

» Markov Decision Processes with Arbitrary Reward Processes

144

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 2 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

134

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

16 years 3 months ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

135

click to vote

ICML
2005
IEEE

157views Machine Learning» more ICML 2005»

A causal approach to hierarchical decomposition of factored MDPs

16 years 4 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

139

click to vote

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 5 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

150

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 8 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 63 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers