Sciweavers

1234 search results - page 210 / 247
» Multi-criteria Reinforcement Learning
Sort
View
EUROGP
2009
Springer
130views Optimization» more  EUROGP 2009»
14 years 3 months ago
One-Class Genetic Programming
One-class classification naturally only provides one-class of exemplars, the target class, from which to construct the classification model. The one-class approach is constructed...
Robert Curry, Malcolm I. Heywood
BMEI
2008
IEEE
14 years 3 months ago
A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy
Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...
Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...
SASO
2008
IEEE
14 years 3 months ago
Self-Adaptive Dissemination of Data in Dynamic Sensor Networks
The distribution of data in large dynamic wireless sensor networks presents a difficult problem due to node mobility, link failures, and traffic congestion. In this paper, we pr...
David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
14 years 3 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
ICANN
2007
Springer
14 years 3 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...