Search Sciweavers | Sciweavers

148 search results - page 26 / 30

» Reinforcement Learning for P2P Searching

click to vote

CPAIOR
2010
Springer

141views Operations Research» more CPAIOR 2010»

Strong Combination of Ant Colony Optimization with Constraint Programming Optimization

14 years 14 days ago

Download liris.cnrs.fr

We introduce an approach which combines ACO (Ant Colony Optimization) and IBM ILOG CP Optimizer for solving COPs (Combinatorial Optimization Problems). The problem is modeled using...

Madjid Khichane, Patrick Albert, Christine Solnon

claim paper

Read More »

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

13 years 9 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

13 years 11 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

click to vote

AI
2002
Springer

117views Artificial Intelligence» more AI 2002»

Programming backgammon using self-teaching neural nets

13 years 7 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...

Gerald Tesauro

claim paper

Read More »

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

13 years 7 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

« Prev « First page 26 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers