GAP: A General Approach to Quantitative Diagnosis of Performance Problems

Quantitative performance diagnosis (QPD)provides explanations that quantify the impact of problem causes.An example of such an explanation is Increased web server tra .c accounts for 90%of the increase in LAN utilization,which in turn accounts for 20%of the increase in web response times.This paper describes GAP,a general approach to quantitative performance diagnosis.GAP has two parts:(1) an algorithm for computing quantitative performance diagnoses and (2)a framework for constructing diagnostic techniques that provides the basis for quanti .cations produced by the algorithm.The GAP algorithm makes use of a easurement navigation graph (MNG),a directed acyclic graph whose nodes are measurement variables and whose arcs have weights that quantify the e .ect of child variables (e.g.,LAN utilization)on parent variables (e.g.,response time).Various properties of the algorithm are established, especially that its quanti .cation of explanations can be interpreted as fractional contributions to the performance problem.Arc weights are computed by diagnostic techniques.A framework for developing diagnostic techniques is described that consists of (a)the choice of statistic (e.g.,mean,variance)to aggregate problem values and (b)the estimator of the statistic.The framework is applied to existing diagnostic techniques to assess their e .ectiveness and is used to construct a new diagnostic technique for a performance problems in a production computing systems.It is also used to show that for uniform magnitude performance problems (e.g.,a step),the standard deviation is preferred to the mean if the problem data have a coe .cient of variation no larger than (1 .f )/f ,where f is the fraction of the data containing the performance problem.

By: Joseph L Hellerstein

Published in: RC22682 in 2002

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RC22682.pdf

Questions about this service can be mailed to reports@us.ibm.com .