Search Sciweavers | Sciweavers

262 search results - page 31 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

148

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 2 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

131

click to vote

MOR
2008

87views more MOR 2008»

On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP

15 years 4 months ago

Download www.cs.helsinki.fi

We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an -...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

110

click to vote

PRIMA
2007
Springer

98views Intelligent Agents» more PRIMA 2007»

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

15 years 10 months ago

Download lang.is.kyushu-u.ac.jp

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for ﬁnding an optimal joint pol...

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

claim paper

Read More »

110

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

174

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 5 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

« Prev « First page 31 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers