Search Sciweavers | Sciweavers

377 search results - page 40 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

202

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 6 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

212

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 7 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

200

click to vote

GECCO
2004
Springer

100views Optimization» more GECCO 2004»

Transfer of Neuroevolved Controllers in Unstable Domains

16 years 8 days ago

Download www.cs.utexas.edu

In recent years, the evolution of artiﬁcial neural networks or neuroevolution has brought promising results in solving diﬃcult reinforcement learning problems. But, like standa...

Faustino J. Gomez, Risto Miikkulainen

claim paper

Read More »

178

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

15 years 1 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

186

click to vote

CORR
2011
Springer

209views Education» more CORR 2011»

Close the Gaps: A Learning-while-Doing Algorithm for a Class of Single-Product Revenue Management Problems

14 years 10 months ago

Download www.stanford.edu

In this work, we consider a retailer selling a single product with limited on-hand inventory over a ﬁnite selling season. Customer demand arrives according to a Poisson process,...

Zizhuo Wang, Shiming Deng, Yinyu Ye

claim paper

Read More »

« Prev « First page 40 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers