Search Sciweavers | Sciweavers

128 search results - page 8 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

218

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

15 years 10 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

166

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 10 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

199

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

16 years 9 days ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

187

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 10 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

189

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 6 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

« Prev « First page 8 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers