Sciweavers

175 search results - page 27 / 35
» Forgetting Reinforced Cases
Sort
View
DIS
2009
Springer
14 years 2 months ago
OMFP: An Approach for Online Mass Flow Prediction in CFB Boilers
Abstract. Fuel feeding and inhomogeneity of fuel typically cause process fluctuations in the circulating fluidized bed (CFB) boilers. If control systems fail to compensate the ...
Indre Zliobaite, Jorn Bakker, Mykola Pechenizkiy
ATAL
2004
Springer
14 years 27 days ago
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents
This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, specifically, geneticlearning-parentin...
Michael Berger, Jeffrey S. Rosenschein
ICML
2010
IEEE
13 years 8 months ago
Finite-Sample Analysis of LSTD
In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...
JCM
2006
95views more  JCM 2006»
13 years 7 months ago
A Learning-based Adaptive Routing Tree for Wireless Sensor Networks
One of the most common communication patterns in sensor networks is routing data to a base station, while the base station can be either static or mobile. Even in static cases, a s...
Ying Zhang, Qingfeng Huang
ATAL
2009
Springer
14 years 2 months ago
A mathematical analysis of collective cognitive convergence
Multi-agent systems are an attractive approach to modeling systems of interacting entities, but in some cases mathematical models of these systems can offer complementary benefits...
H. Van Dyke Parunak