Search Sciweavers | Sciweavers

473 search results - page 53 / 95

» Optimal policy switching algorithms for reinforcement learni...

192

click to vote

ATAL
2007
Springer

155views Intelligent Agents» more ATAL 2007»

Reinforcement learning in extensive form games with incomplete information: the bargaining case study

16 years 9 days ago

Download home.dei.polimi.it

We consider the problem of ﬁnding optimal strategies in inﬁnite extensive form games with incomplete information that are repeatedly played. This problem is still open in lite...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Ni...

claim paper

Read More »

199

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

15 years 10 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

167

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

15 years 1 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

150

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 11 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

149

Voted

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 7 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 53 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers