This paper describes the result of performance evaluation of two kinds of MapReduce applications running in the FutureGrid: a data intensive application and a computation intensive application. For this work, we construct a virtualized cluster system made of a set of VM instances. We observe that the overall performance of a data intensive application is strongly affected by the configuration of the VMs. It can be used to identify the bottleneck of the MapReduce application running on the virtualized cluster system with various VM instances.
Yunhee Kang, Geoffrey C. Fox