Generalized pursuit learning schemes: new families of continuous and discretized learning automata

15 years 6 months ago

Download ce.sharif.edu

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry [24]. The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, we argue that a pursuit scheme that generalizes the traditional pursuit algorithm by pursuing all the actions with higher reward estimates than the chosen action, minimizes the probability of pursuing a wrong action, and is a faster converging scheme. To attest this, we present two new generalized pursuit algorithms (GPAs) and also present a quantitative comparison of their performance against the existing pursuit algorithms. Empirically, the algorithms proposed here are among the fastest reported LA to date.

M. Agache, B. John Oommen

Real-time Traffic

Algorithms | Pursuit Algorithms | TSMC 2002 | Wrong Action |

claim paper

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	2002
Where	TSMC
Authors	M. Agache, B. John Oommen

Sciweavers

Generalized pursuit learning schemes: new families of continuous and discretized learning automata

Algorithms | Pursuit Algorithms | TSMC 2002 | Wrong Action |

Explore & Download

Productivity Tools

Sciweavers