We give the first rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds pr...
As energy-related costs have become a major economical factor for IT infrastructures and data-centers, companies and the research community are being challenged to find better an...
This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...
This paper aims to forecast the economic impacts of changing land-use in UK uplands. We assume that farmers adaptively learn and respond to a dynamic economic environment. The main...
Nanlin Jin, Mette Termansen, Klaus Hubacek, Joseph...
Heterogeneous clusters and grid infrastructures are becoming increasingly popular. In these computing infrastructures, machines have different resources, including memory sizes, d...