Sciweavers

377 search results - page 23 / 76
» Optimizing Production Manufacturing Using Reinforcement Lear...
Sort
View
ICDIM
2007
IEEE
14 years 1 months ago
Estimating product lifecycle cost using a hybrid approach
It becomes clear for manufacturing companies that product lifecycle cost (LCC) is as crucial as product quality and functionality in deciding the success of a product in the marke...
Haifeng Liu, Vivekanand Gopalkrishnan, Wee Keong N...
ICML
1995
IEEE
14 years 8 months ago
Residual Algorithms: Reinforcement Learning with Function Approximation
A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...
Leemon C. Baird III
ICML
2007
IEEE
14 years 8 months ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...
COLT
2004
Springer
14 years 23 days ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
JMLR
2010
148views more  JMLR 2010»
13 years 2 months ago
A Generalized Path Integral Control Approach to Reinforcement Learning
With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal