Search Sciweavers | Sciweavers

260 search results - page 34 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

217

click to vote

VTC
2008
IEEE

185views Communications» more VTC 2008»

Opportunistic Spectrum Access for Energy-Constrained Cognitive Radios

16 years 12 days ago

Download www1.i2r.a-star.edu.sg

This paper considers a scenario in which a secondary user makes opportunistic use of a channel allocated to some primary network. The primary network operates in a time-slotted ma...

Anh Tuan Hoang, Ying-Chang Liang, David Tung Chong...

claim paper

Read More »

167

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 4 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

149

click to vote

MOR
2008

87views more MOR 2008»

On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP

15 years 6 months ago

Download www.cs.helsinki.fi

We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an -...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

224

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 7 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

150

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

16 years 3 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

« Prev « First page 34 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers