Optimizing HPC Applications with Intel Cluster Tools: Hunting Petaflops

Optimizing HPC Applications with Intel Cluster Tools: Hunting Petaflops

Alexander Supalov, Andrey Semin, Christopher Dahnken, Michael Klemm


Optimizing HPC purposes with Intel® Cluster instruments takes the reader on a travel of the fast-growing sector of excessive functionality computing and the optimization of hybrid courses. those courses ordinarily mix allotted reminiscence and shared reminiscence programming versions and use the Message Passing Interface (MPI) and OpenMP for multi-threading to accomplish the final word target of excessive functionality at low energy intake on enterprise-class workstations and compute clusters.

The publication specializes in optimization for clusters along with the Intel® Xeon processor, however the optimization methodologies additionally practice to the Intel® Xeon Phi™ coprocessor and heterogeneous clusters blending either architectures. along with the educational and reference content material, the authors deal with and refute many myths and misconceptions surrounding the subject. The textual content is augmented and enriched via descriptions of real-life situations.

What you’ll learn

  • Practical, hands-on examples exhibit tips to make clusters and workstations in line with Intel® Xeon processors and Intel® Xeon Phi™ coprocessors "sing" in Linux environments

  • How to grasp the synergy of Intel® Parallel Studio XE 2015 Cluster version, together with Intel® Composer XE, Intel® MPI Library, Intel® hint Analyzer and Collector, Intel® VTune™ Amplifier XE, and plenty of different invaluable tools

  • How to accomplish fast and tangible optimization effects whereas refining your knowing of software program layout principles

Who this e-book is for

software program execs will use this ebook to layout, strengthen, and optimize their parallel courses on Intel systems. scholars of computing device technology and engineering will price the booklet as a entire reader, appropriate to many optimization classes provided all over the world. The amateur reader will get pleasure from an intensive grounding within the intriguing global of parallel computing.

Table of Contents

Foreword through Bronis de Supinski, CTO, Livermore Computing, LLNL

Introduction

Chapter 1: No Time to learn this Book?

Chapter 2: evaluation of Platform Architectures

Chapter three: Top-Down software program Optimization

Chapter four: Addressing method Bottlenecks

Chapter five: Addressing software Bottlenecks: disbursed Memory

Chapter 6: Addressing program Bottlenecks: Shared Memory

Chapter 7: Addressing software Bottlenecks: Microarchitecture

Chapter eight: software layout Considerations

Show sample text content

Download sample