Search Sciweavers | Sciweavers

38 search results - page 3 / 8

» Improving reinforcement learning function approximators via ...

170

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 7 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

175

Voted

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

16 years 1 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

167

Voted

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 8 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

192

Voted

IROS
2006
IEEE

190views Robotics» more IROS 2006»

Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning

16 years 23 days ago

Download www.aass.oru.se

Abstract— This paper presents a learning system that uses Qlearning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a functi...

Jun Li, Achim J. Lilienthal, Tomás Mart&iac...

claim paper

Read More »

167

Voted

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 8 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers