Search Sciweavers | Sciweavers

81 search results - page 11 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

189

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

16 years 2 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

205

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

275

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 9 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

218

click to vote

CEC
2005
IEEE

98views Artificial Intelligence» more CEC 2005»

XCS with computed prediction in continuous multistep environments

15 years 9 months ago

Download www.eskimo.com

We apply XCS with computed prediction (XCSF) to tackle multistep reinforcement learning problems involving continuous inputs. In essence we use XCSF as a method of generalized rein...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

242

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

15 years 11 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

« Prev « First page 11 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers