Search Sciweavers | Sciweavers

3381 search results - page 218 / 677

» LEO - DB2's LEarning Optimizer

111

click to vote

GECCO
2005
Springer

97views Optimization» more GECCO 2005»

Interactive estimation of agent-based financial markets models: modularity and learning

15 years 10 months ago

Download www.cs.bham.ac.uk

Building upon the interactive inversion method introduced by Ashburn and Bonabeau (2004), we show how to dramatically improve the results by exploiting modularity and by letting t...

M. Ihsan Ecemis, Eric Bonabeau, Trent Ashburn

claim paper

Read More »

128

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 10 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

127

click to vote

IDEAL
2004
Springer

138views Intelligent Agents» more IDEAL 2004»

DIVACE: Diverse and Accurate Ensemble Learning Algorithm

15 years 10 months ago

Download www.cs.bham.ac.uk

In order for a neural network ensemble to generalise properly, two factors are considered vital. One is the diversity and the other is the accuracy of the networks that comprise th...

Arjun Chandra, Xin Yao

claim paper

Read More »

121

click to vote

PRICAI
2004
Springer

162views Artificial Intelligence» more PRICAI 2004»

Covisibility-Based Map Learning Method for Mobile Robots

15 years 10 months ago

Download www.space.rcast.u-tokyo.ac.jp

In previous work, we proposed a unique landmark-based map learning method for mobile robots based on the “co-visibility” information i.e., very coarse qualitative information o...

Takehisa Yairi

claim paper

Read More »

153

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 9 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 218 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers