Sciweavers

3381 search results - page 218 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
GECCO
2005
Springer
15 years 10 months ago
Interactive estimation of agent-based financial markets models: modularity and learning
Building upon the interactive inversion method introduced by Ashburn and Bonabeau (2004), we show how to dramatically improve the results by exploiting modularity and by letting t...
M. Ihsan Ecemis, Eric Bonabeau, Trent Ashburn
COLT
2004
Springer
15 years 10 months ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
IDEAL
2004
Springer
15 years 10 months ago
DIVACE: Diverse and Accurate Ensemble Learning Algorithm
In order for a neural network ensemble to generalise properly, two factors are considered vital. One is the diversity and the other is the accuracy of the networks that comprise th...
Arjun Chandra, Xin Yao
PRICAI
2004
Springer
15 years 10 months ago
Covisibility-Based Map Learning Method for Mobile Robots
In previous work, we proposed a unique landmark-based map learning method for mobile robots based on the “co-visibility” information i.e., very coarse qualitative information o...
Takehisa Yairi
ICANN
2001
Springer
15 years 9 months ago
Market-Based Reinforcement Learning in Partially Observable Worlds
Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...
Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber