X10 for Productivity and Performance at Scale: A Submission to the 2012 HPC Class II Challange

We implement all four HPC Class I benchmarks in X10: Global HPL, Global RandomAccess, EP Stream (Triad), and Global FFT. We also implement the Unbalanced Tree Search benchmark (UTS). We show performance results for these benchmarks running on an IBM Power 775 Supercomputer utilizing up to 47,040 Power7 cores. We believe that our UTS implementation demonstrates that X10 can deliver unprecedented productivity and performance at scale for unbalanced workloads.

The X10 tool chain and the benchmark codes are publicly available at http://x10-lang.org.

By: Olivier Tardieu, David Grove, Bard Bloom, David Cunningham, Benjamin Herta, Prabhanjan Kambadur, Vijay A. Saraswat, Avraham Shinnar, Mikio Takeuchi, Mandana Vaziri

Published in: RC25334 in 2012

rc25334.pdf

Questions about this service can be mailed to reports@us.ibm.com .