Search Sciweavers | Sciweavers

473 search results - page 57 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

13 years 15 days ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

click to vote

KDD
2002
ACM

147views Data Mining» more KDD 2002»

Sequential cost-sensitive decision making with reinforcement learning

14 years 9 months ago

Download www.research.ibm.com

Recently, there has been increasing interest in the issues of cost-sensitive learning and decision making in a variety of applications of data mining. A number of approaches have ...

Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny

claim paper

Read More »

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

14 years 3 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

click to vote

ML
1998
ACM

101views Machine Learning» more ML 1998»

Elevator Group Control Using Multiple Reinforcement Learning Agents

13 years 8 months ago

Download www.clear.rice.edu

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...

Robert H. Crites, Andrew G. Barto

claim paper

Read More »

click to vote

GECCO
2006
Springer

175views Optimization» more GECCO 2006»

A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism

14 years 21 days ago

Download www.cs.bham.ac.uk

Two mathematical and two computational theories from the field of human and animal learning are combined to produce a more general theory of adaptive behavior. The cornerstone of ...

J. J. McDowell, Paul L. Soto, Jesse Dallery, Saule...

claim paper

Read More »

« Prev « First page 57 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers