Search Sciweavers | Sciweavers

508 search results - page 27 / 102

» Learning for stochastic dynamic programming

155

click to vote

COR
2007

133views more COR 2007»

Reverse logistics network design with stochastic lead times

15 years 7 months ago

Download www.ise.ufl.edu

This work is concerned with the efﬁcient design of a reverse logistics network using an extended version of models currently found in the literature. Those traditional, basic mo...

Kris Lieckens, Nico Vandaele

claim paper

Read More »

178

click to vote

ICML
2005
IEEE

159views Machine Learning» more ICML 2005»

Bounded real-time dynamic programming: RTDP with monotone upper bounds and performance guarantees

16 years 7 months ago

Download www.cs.cmu.edu

MDPs are an attractive formalization for planning, but realistic problems often have intractably large state spaces. When we only need a partial policy to get from a fixed start s...

H. Brendan McMahan, Maxim Likhachev, Geoffrey J. G...

claim paper

Read More »

176

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 7 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

192

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

15 years 4 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

175

click to vote

CDC
2009
IEEE

143views Control Systems» more CDC 2009»

Parameter approximate dynamic optimization for PSO systems

15 years 10 months ago

Download www.nt.ntnu.no

— This paper presents a novel swarm approximate dynamic programming method (swarm-ADP) for parameter optimization of PSO systems, from the perspective of optimal control. Based o...

Qi Kang, Lei Wang, Derong Liu, Qidi Wu

claim paper

Read More »

« Prev « First page 27 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers