Sciweavers

1233 search results - page 180 / 247
» Feudal Reinforcement Learning
Sort
View
ICML
1994
IEEE
13 years 11 months ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...
DIS
2009
Springer
14 years 2 months ago
OMFP: An Approach for Online Mass Flow Prediction in CFB Boilers
Abstract. Fuel feeding and inhomogeneity of fuel typically cause process fluctuations in the circulating fluidized bed (CFB) boilers. If control systems fail to compensate the ...
Indre Zliobaite, Jorn Bakker, Mykola Pechenizkiy
EUROGP
2009
Springer
130views Optimization» more  EUROGP 2009»
14 years 2 months ago
One-Class Genetic Programming
One-class classification naturally only provides one-class of exemplars, the target class, from which to construct the classification model. The one-class approach is constructed...
Robert Curry, Malcolm I. Heywood
BMEI
2008
IEEE
14 years 2 months ago
A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy
Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...
Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...
SASO
2008
IEEE
14 years 2 months ago
Self-Adaptive Dissemination of Data in Dynamic Sensor Networks
The distribution of data in large dynamic wireless sensor networks presents a difficult problem due to node mobility, link failures, and traffic congestion. In this paper, we pr...
David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...