Search Sciweavers | Sciweavers

121 search results - page 16 / 25

» Toward Off-Policy Learning Control with Function Approximati...

179

click to vote

GECCO
2007
Springer

235views Optimization» more GECCO 2007»

Expensive optimization, uncertain environment: an EA-based solution

16 years 26 days ago

Download www.cs.bham.ac.uk

Real life optimization problems often require finding optimal solution to complex high dimensional, multimodal problems involving computationally very expensive fitness function e...

Maumita Bhattacharya

claim paper

Read More »

160

click to vote

GECCO
2007
Springer

186views Optimization» more GECCO 2007»

ICSPEA: evolutionary five-axis milling path optimisation

16 years 26 days ago

Download www.cs.bham.ac.uk

ICSPEA is a novel multi-objective evolutionary algorithm which integrates aspects from the powerful variation operators of the Covariance Matrix Adaptation Evolution Strategy (CMA...

Jörn Mehnen, Rajkumar Roy, Petra Kersting, To...

claim paper

Read More »

214

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 6 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

193

Voted

AAAI
1998

181views Intelligent Agents» more AAAI 1998»

Applying Online Search Techniques to Continuous-State Reinforcement Learning

15 years 8 months ago

Download www.autonlab.org

In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...

Scott Davies, Andrew Y. Ng, Andrew W. Moore

claim paper

Read More »

186

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 8 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 16 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers