Search Sciweavers | Sciweavers

3381 search results - page 196 / 677

» LEO - DB2's LEarning Optimizer

147

Voted

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 8 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

154

click to vote

GECCO
2000
Springer

143views Optimization» more GECCO 2000»

A Genetic Algorithm for Automatically Designing Modular Reinforcement Learning Agents

15 years 8 months ago

Download www.cs.bham.ac.uk

Reinforcement learning (RL) is one of the machine learning techniques and has been received much attention as a new self-adaptive controller for various systems. The RL agent auto...

Isao Ono, Tetsuo Nijo, Norihiko Ono

claim paper

Read More »

126

click to vote

ICASSP
2011
IEEE

102views Signal Processing» more ICASSP 2011»

Social norm and long-run learning in peer-to-peer networks

14 years 8 months ago

Download mirlab.org

We start by formulating the resource sharing in peer-to-peer (P2P) networks as a random-matching gift-giving game, where self-interested peers aim at maximizing their own long-ter...

Yu Zhang, Mihaela van der Schaar

claim paper

Read More »

227

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

14 years 8 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

269

Voted

TKDE
2012

245views Formal Methods» more TKDE 2012»

Semi-Supervised Maximum Margin Clustering with Pairwise Constraints

13 years 7 months ago

Download www.comp.hkbu.edu.hk

—The pairwise constraints specifying whether a pair of samples should be grouped together or not have been successfully incorporated into the conventional clustering methods such...

Hong Zeng, Yiu-ming Cheung

claim paper

Read More »

« Prev « First page 196 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers