Sciweavers

3381 search results - page 196 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
147
Voted
GECCO
2006
Springer
208views Optimization» more  GECCO 2006»
15 years 8 months ago
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
GECCO
2000
Springer
143views Optimization» more  GECCO 2000»
15 years 8 months ago
A Genetic Algorithm for Automatically Designing Modular Reinforcement Learning Agents
Reinforcement learning (RL) is one of the machine learning techniques and has been received much attention as a new self-adaptive controller for various systems. The RL agent auto...
Isao Ono, Tetsuo Nijo, Norihiko Ono
ICASSP
2011
IEEE
14 years 8 months ago
Social norm and long-run learning in peer-to-peer networks
We start by formulating the resource sharing in peer-to-peer (P2P) networks as a random-matching gift-giving game, where self-interested peers aim at maximizing their own long-ter...
Yu Zhang, Mihaela van der Schaar
GECCO
2011
Springer
276views Optimization» more  GECCO 2011»
14 years 8 months ago
Evolution of reward functions for reinforcement learning
The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...
Scott Niekum, Lee Spector, Andrew G. Barto
269
Voted
TKDE
2012
245views Formal Methods» more  TKDE 2012»
13 years 7 months ago
Semi-Supervised Maximum Margin Clustering with Pairwise Constraints
—The pairwise constraints specifying whether a pair of samples should be grouped together or not have been successfully incorporated into the conventional clustering methods such...
Hong Zeng, Yiu-ming Cheung