In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...
Abstract--This paper tackles the theoretical performance analysis of widely-linear (WL) multiuser receivers for direct-sequence code-division multiple-access (DS-CDMA) systems, as ...
Angela Sara Cacciapuoti, Giacinto Gelli, Luigi Pau...
Many algorithms for independent component analysis (ICA) and blind source separation (BSS) can be considered particular instances of a criterion based on the sum of two terms: C(Y...
Over the past few years, the notion of stability in data clustering has received growing attention as a cluster validation criterion in a sample-based framework. However, recent w...
The finite sample properties of the Fourier estimator of integrated volatility under market microstructure noise are studied. Analytic expressions for the bias and the mean square...