Search Sciweavers | Sciweavers

128 search results - page 7 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

137

TSMC
2002

69views more TSMC 2002»

A new learning algorithm for the hierarchical structure learning automata operating in the nonstationary S-model random environm

15 years 6 months ago

Download ir.lib.osaka-kyoiku.ac.jp

An extended algorithm of the relative reward strength algorithm is proposed. It is shown that the proposed algorithm ensures the convergence with probability 1 to the optimal path ...

Norio Baba, Yoshio Mogami

claim paper

Read More »

152

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

16 years 1 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

178

Voted

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

16 years 3 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

165

click to vote

ICAC
2005
IEEE

108views Applied Computing» more ICAC 2005»

Self-Optimizing Architecture for QoS Provisioning in Differentiated Services

16 years 16 days ago

Download csdl2.computer.org

This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...

Daniel Yagan, Chen-Khong Tham

claim paper

Read More »

213

click to vote

GECCO
2006
Springer

198views Optimization» more GECCO 2006»

Reward allotment in an event-driven hybrid learning classifier system for online soccer games

15 years 10 months ago

Download www.cs.bham.ac.uk

This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...

Yuji Sato, Yosuke Akatsuka, Takenori Nishizono

claim paper

Read More »

« Prev « First page 7 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers