Sciweavers

1512 search results - page 232 / 303
» Qualitative reinforcement learning
Sort
View
BMEI
2008
IEEE
14 years 2 months ago
A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy
Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...
Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...
SASO
2008
IEEE
14 years 2 months ago
Self-Adaptive Dissemination of Data in Dynamic Sensor Networks
The distribution of data in large dynamic wireless sensor networks presents a difficult problem due to node mobility, link failures, and traffic congestion. In this paper, we pr...
David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
14 years 2 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
ICANN
2007
Springer
14 years 2 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
CIMCA
2006
IEEE
14 years 1 months ago
Multi-Agent Coalition Formation for Long-Term Task or Mobile Network
Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...
Hsiu-Hui Lee, Chung-Hsien Chen