Sciweavers

777 search results - page 13 / 156
» Learning dynamic algorithm portfolios
Sort
View
CIA
2007
Springer
14 years 1 months ago
Multi-agent Learning Dynamics: A Survey
Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...
H. Jaap van den Herik, Daniel Hennes, Michael Kais...
NIPS
1993
13 years 8 months ago
Convergence of Stochastic Iterative Dynamic Programming Algorithms
Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...
Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...
IJIT
2004
13 years 8 months ago
Improving the Convergence of the Backpropagation Algorithm Using Local Adaptive Techniques
Since the presentation of the backpropagation algorithm, a vast variety of improvements of the technique for training a feed forward neural networks have been proposed. This articl...
Z. Zainuddin, N. Mahat, Y. Abu Hassan
EUSFLAT
2003
13 years 8 months ago
Stability of backpropagation learning rule
A control of real processes requires different approach to neural network learning. The presented modification of backpropagation learning algorithm changes a meaning of learning...
Petr Krupanský, Petr Pivoñka, Jiri D...
SARA
2007
Springer
14 years 1 months ago
Active Learning of Dynamic Bayesian Networks in Markov Decision Processes
Several recent techniques for solving Markov decision processes use dynamic Bayesian networks to compactly represent tasks. The dynamic Bayesian network representation may not be g...
Anders Jonsson, Andrew G. Barto