Running Data Grid applications such as High Energy Nuclear Physics (HENP) and weather modelling experiments involves working with huge data sets possibly of hundreds of Terabytes to Petabytes in size often kept over wide area networks. Data replication is a useful technique for reducing latency across communication networks over which the source data are accessed. However recent studies have suggested that Data Grid network performance can be enhanced further through a combination of approaches incorporating replication and CPU resource scheduling approaches. As a starting point towards developing a multifaceted optimisation solution for Data Grids, this paper considers the effect of replication and storage parameter settings on Data Grid performance. The results we will consider are based on Data Grid models developed using the OPNET simulation package.
Ernest Sithole, Gerard P. Parr, Sally I. McClean