Design and analysis of a multi-dimensional data sampling service for large scale data analysis applications