PreDatA - preparatory data analytics on peta-scale machines

15 years 4 months ago

Download www.cercs.gatech.edu

Peta-scale scientific applications running on High End Computing (HEC) platforms can generate large volumes of data. For high performance storage and in order to be useful to science end users, such data must be organized in its layout, indexed, sorted, and otherwise manipulated for subsequent data presentation, visualization, and detailed analysis. In addition, scientists desire to gain insights into selected data characteristics `hidden' or `latent' in the massive datasets while data is being produced by simulations. PreDatA, short for Preparatory Data Analytics, is an approach for preparing and characterizing data while it is being produced by the large scale simulations running on peta-scale machines. By dedicating additional compute nodes on the peta-scale machine as staging nodes and staging simulation's output data through these nodes, PreDatA can exploit their computational power to perform selected data manipulations with lower latency than attainable by first m...

Fang Zheng, Hasan Abbasi, Ciprian Docan, Jay F. Lo

Real-time Traffic

Data Characteristics | Distributed And Parallel Computing | IPPS 2010 | Subsequent Data | Subsequent Data Presentation |

claim paper

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	IPPS
Authors	Fang Zheng, Hasan Abbasi, Ciprian Docan, Jay F. Lofstead, Qing Liu, Scott Klasky, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Matthew Wolf

Comments (0)

Sciweavers

PreDatA - preparatory data analytics on peta-scale machines

Data Characteristics | Distributed And Parallel Computing | IPPS 2010 | Subsequent Data | Subsequent Data Presentation |

Explore & Download

Productivity Tools

Sciweavers