Search Sciweavers | Sciweavers

377 search results - page 7 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

288

click to vote

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 3 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

169

click to vote

CCGRID
2008
IEEE

127views Distributed And Parallel Com...» more CCGRID 2008»

Grid Differentiated Services: A Reinforcement Learning Approach

16 years 1 months ago

Download hal.inria.fr

—Large scale production grids are a major case for autonomic computing. Following the classical deﬁnition of Kephart, an autonomic computing system should optimize its own beha...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

247

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

218

click to vote

ISPE
2003

147views Distributed And Parallel Com...» more ISPE 2003»

A collaborative knowledge management system for concurrent design and manufacturing

15 years 8 months ago

Download www.wmstubblefield.com

ABSTRACT: Knowledge systems for scientific and engineering endeavors must be able to insure the accuracy, completeness, and validity of their contents. When designed as such, these...

A. H. Liszka, William A. Stubblefield, Stephen D. ...

claim paper

Read More »

170

click to vote

GECCO
2003
Springer

79views Optimization» more GECCO 2003»

Reinforcement Learning Estimation of Distribution Algorithm

15 years 12 months ago

Download www.iba.t.u-tokyo.ac.jp

Abstract. This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions ...

Topon Kumar Paul, Hitoshi Iba

claim paper

Read More »

« Prev « First page 7 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers