Performance Characterization and Micro-Architecture Exploration of a Data Mining Application via Hardware Based Performance Monitoring and Simulation

Internal complexity in modern microprocessors increasingly hides how programs actually execute. Increasing use of techniques that include the use of caches and out-of-order execution to increase performance results in non-deterministic execution. Trying to predict the actual performance of a program is no longer just adding up the execution times of the instructions and the memory references. This paper explores the use of two approaches to examine the execution of a program on a superscalar processor. The first approach utilizes hardware based performance monitoring to examine the execution of a widely used data mining program using a synthetic benchmark program for performance bottlenecks. The second approach explores the use of a micro-architecture simulator to explore the impact of micro-architecture changes on the performance of the same data mining application. Impact of the structure of the source code on the amount of instruction level parallelism is also explored in the micro-architecture simulator by studying the change in performance from a source level change. Results from both approaches are discussed and changes to the micro-architecture are proposed.

By: Mathew S. Thoennes

Published in: RC22223 in 2002

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

Rc22223.pdf

Questions about this service can be mailed to reports@us.ibm.com .