Search Sciweavers | Sciweavers

377 search results - page 21 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

208

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

16 years 26 days ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

159

Voted

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

16 years 3 days ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

187

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 11 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

195

Voted

ATAL
2007
Springer

181views Intelligent Agents» more ATAL 2007»

Multiagent reinforcement learning and self-organization in a network of agents

16 years 27 days ago

Download mas.cs.umass.edu

To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning technique...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

158

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 21 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers