Search Sciweavers | Sciweavers

84 search results - page 5 / 17

» Integrating Reinforcement Learning, Bidding and Genetic Algo...

273

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

14 years 9 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

124

click to vote

ATAL
2004
Springer

221views Intelligent Agents» more ATAL 2004»

When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

15 years 11 months ago

Download leibniz.cs.huji.ac.il

This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, speciﬁcally, geneticlearning-parentin...

Michael Berger, Jeffrey S. Rosenschein

claim paper

Read More »

140

click to vote

GECCO
2008
Springer

136views Optimization» more GECCO 2008»

On the genetic programming of time-series predictors for supply chain management

15 years 7 months ago

Download www.cs.bham.ac.uk

Single and multi-step time-series predictors were evolved for forecasting minimum bidding prices in a simulated supply chain management scenario. Evolved programs were allowed to ...

Alexandros Agapitos, Matthew Dyson, Jenya Kovalchu...

claim paper

Read More »

133

click to vote

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

159

click to vote

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 7 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

« Prev « First page 5 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers