Models of Parallel Applications with Large Computation and I/O Requirements

Copyright © (2002) by IEEE. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distrubuted for profit. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee.

        A fundamental Understanding of the Interplay of computation and I/O activities in parallel applications that manipulate huge amounts of data is critical to achieving good application performance, as well as to identifying principles for the design and management of parallel system software. In this paper we exploit a formal model of the behavior of large-scale parallel applications based on observations regarding the interaction of CPU and I/O activity in a representative set of scientific codes. The model can be applied at various levels of characterization granularity. Results from the model are in good agreement with measurement data from a set of representative I/O intensive codes. A capacity planning example is provided to illustrate one of the potential uses of our methodology in practice. The model is also used for performance prediction via a set of functional forms that effectively forecast the scalability of the computation and I/O components of an application. The complexity of the functional form adopted affects the accuracy of performance prediction.

By: Emilia Rosti, Giuseppe Serazzi, Evgenia Smirni, Mark S. Squillante

Published in: IEEE Transactions on Software Engineering, volume 28, (no 3), pages 286-307 in 2002

Please obtain a copy of this paper from your local library. IBM cannot distribute this paper externally.

Questions about this service can be mailed to reports@us.ibm.com .