Search Sciweavers | Sciweavers

377 search results - page 23 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

172

Voted

ICDIM
2007
IEEE

111views Information Technology» more ICDIM 2007»

Estimating product lifecycle cost using a hybrid approach

16 years 1 months ago

Download www.cais.ntu.edu.sg

It becomes clear for manufacturing companies that product lifecycle cost (LCC) is as crucial as product quality and functionality in deciding the success of a product in the marke...

Haifeng Liu, Vivekanand Gopalkrishnan, Wee Keong N...

claim paper

Read More »

187

Voted

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 7 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

190

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 7 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

168

Voted

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

16 years 5 days ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

242

Voted

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

« Prev « First page 23 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers