Goal-directed Performance Tuning for Scientific Applications

Goal-directed Performance Tuning for Scientific Applications
Author :
Publisher :
Total Pages : 346
Release :
ISBN-10 : UOM:39015037838441
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Goal-directed Performance Tuning for Scientific Applications by : Tien-Pao Shih

Download or read book Goal-directed Performance Tuning for Scientific Applications written by Tien-Pao Shih and published by . This book was released on 1996 with total page 346 pages. Available in PDF, EPUB and Kindle. Book excerpt: Abstract: "Performance tuning, as carried out by compiler designers and application programmers to close the performance gap between the achievable peak and delivered performance, becomes increasingly important and challenging as the microprocessor speeds and system sizes increase. However, although performance tuning on scientific codes usually deals with relatively small program regions, it is not generally known how to establish a reasonable performance objective and how to efficiently achieve this objective. We suggest a goal-directed approach and develop such an approach for each of three major system performance components: central processor unit (CPU) computation, memory accessing, and communication. For the CPU, we suggest using a machine-application performance model that characterizes workloads on four key function units (memory, floating-point, issue, and a virtual 'dependence unit') to produce an upper bound performance objective, and derive a mechanism to approach this objective. A case study shows an average 1.79x speedup achieved by using this approach for the Livermore Fortran Kernels 1-12 running on the IBM RS/6000. For memory, as compulsory and capacity misses are relatively easy to characterize, we derive a method for building application-specific cache behavior models that report the number of misses for all three types of conflict misses: self, cross, and ping-pong. The method uses averaging concepts to determine the expected number of cache misses instead of attempting to count them exactly in each instance, which provides a more rapid, yet realistic assessment of expected cache behavior. For each type of conflict miss, we propose a reduction method that uses one or a combination of three techniques based on modifying or exploiting data layout: array padding, initial address adjustment, and access resequencing. A case study using a blocked matrix multiply program as an example shows that the model is within 11% of the simulation results, and that each type of conflict miss can be effectively reduced or completely eliminated. For communication in shared memory parallel systems, we derive an array grouping mechanism and related loop transformations to reduce communication caused by the problematic case of nonconsecutive references to shared arrays and prove several theorems that determine when and where to apply this technique. The experimental results show a 15% reduction in communication, a 40% reduction in data subcache misses, and an 18% reduction in maximum user time for a finite element application on a 56 processor KSR1 parallel computer."


Goal-directed Performance Tuning for Scientific Applications Related Books

Goal-directed Performance Tuning for Scientific Applications
Language: en
Pages: 346
Authors: Tien-Pao Shih
Categories: Cache memory
Type: BOOK - Published: 1996 - Publisher:

DOWNLOAD EBOOK

Abstract: "Performance tuning, as carried out by compiler designers and application programmers to close the performance gap between the achievable peak and del
Performance Evaluation and Benchmarking with Realistic Applications
Language: en
Pages: 316
Authors: Rudolf Eigenmann
Categories: Business & Economics
Type: BOOK - Published: 2001 - Publisher: MIT Press

DOWNLOAD EBOOK

The book discusses rationales for creating and updating benchmarks, the use of benchmarks in academic research, benchmarking methodologies, the relation of SPEC
Improving Cache Performance Via Active Management
Language: en
Pages: 270
Authors: Edward S. Tam
Categories:
Type: BOOK - Published: 1999 - Publisher:

DOWNLOAD EBOOK

Performance Tuning of Scientific Applications
Language: en
Pages: 400
Authors: David H. Bailey
Categories: Computers
Type: BOOK - Published: 2010-11-23 - Publisher: CRC Press

DOWNLOAD EBOOK

With contributions from some of the most notable experts in the field, Performance Tuning of Scientific Applications presents current research in performance an
Wavefront Cache-friendly Algorithm for Compact Numerical Schemes
Language: en
Pages: 14
Authors: Alex Povitsky
Categories: Algorithms
Type: BOOK - Published: 1999 - Publisher:

DOWNLOAD EBOOK

Abstract: "Compact numerical schemes provide high-order solution of PDEs with low dissipation and dispersion. Computer implementation of these schemes requires