Sciweavers

377 search results - page 45 / 76
» Optimizing Production Manufacturing Using Reinforcement Lear...
Sort
View
ICML
2010
IEEE
13 years 6 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
ECML
2005
Springer
14 years 2 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
ICML
2004
IEEE
14 years 9 months ago
Multi-task feature and kernel selection for SVMs
We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...
Tony Jebara
WSC
2000
13 years 9 months ago
Simulation optimization of stochastic systems with integer variables by sequential linearization
Discrete-event simulation is widely used to analyse and improve the performance of manufacturing systems. The related optimization problem often includes integer design variables ...
S. J. Abspoel, L. F. P. Etman, J. Vervoort, J. E. ...
GECCO
2008
Springer
148views Optimization» more  GECCO 2008»
13 years 9 months ago
On the effects of node duplication and connection-oriented constructivism in neural XCSF
For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...
Gerard David Howard, Larry Bull