Search Sciweavers | Sciweavers

463 search results - page 48 / 93

» Localizing Search in Reinforcement Learning

184

click to vote

AI
1999
Springer

129views Artificial Intelligence» more AI 1999»

Learning Action Strategies for Planning Domains

15 years 7 months ago

Download www.aiai.ed.ac.uk

There are many different approaches to solving planning problems, one of which is the use of domain specific control knowledge to help guide a domain independent search algorithm. ...

Roni Khardon

claim paper

Read More »

194

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 9 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

188

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

15 years 11 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

188

click to vote

AI
2002
Springer

117views Artificial Intelligence» more AI 2002»

Programming backgammon using self-teaching neural nets

15 years 7 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...

Gerald Tesauro

claim paper

Read More »

170

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 6 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

« Prev « First page 48 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers