Performance of NAS Parallel Benchmark LU on IBM SP Systems

In this paper, we describe and compare performance of the NAS Parallel Benchmark LU on five different parallel systems spanning two generations of the IBM Scalable POWERparallel architectures. The family of SP systems we consider are: the SP1-TB0, SP1-TB2, SP2-TN, SP2-TN2, and SP2-WN. We describe some of the highlights of our parallel implementation for distributed memory systems. We also discuss some of the optimization steps we used to realize good performance on individual processors and to reduce communication overheads. (ScalParSys)

By: Vijay K. Naik

Published in: RC20046 in 1995

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

4260.ps.gz

Questions about this service can be mailed to reports@us.ibm.com .