Buch, Englisch, 244 Seiten, Format (B × H): 156 mm x 234 mm, Gewicht: 454 g
Programming and Applications
Buch, Englisch, 244 Seiten, Format (B × H): 156 mm x 234 mm, Gewicht: 454 g
Reihe: Chapman & Hall/CRC Computational Science
ISBN: 978-1-138-37268-9
Verlag: Taylor & Francis Ltd
High Performance Computing: Programming and Applications presents techniques that address new performance issues in the programming of high performance computing (HPC) applications. Omitting tedious details, the book discusses hardware architecture concepts and programming techniques that are the most pertinent to application developers for achieving high performance. Even though the text concentrates on C and Fortran, the techniques described can be applied to other languages, such as C++ and Java.
Drawing on their experience with chips from AMD and systems, interconnects, and software from Cray Inc., the authors explore the problems that create bottlenecks in attaining good performance. They cover techniques that pertain to each of the three levels of parallelism:
- Message passing between the nodes
- Shared memory parallelism on the nodes or the multiple instruction, multiple data (MIMD) units on the accelerator
- Vectorization on the inner level
After discussing architectural and software challenges, the book outlines a strategy for porting and optimizing an existing application to a large massively parallel processor (MPP) system. With a look toward the future, it also introduces the use of general purpose graphics processing units (GPGPUs) for carrying out HPC computations. A companion website at www.hybridmulticoreoptimization.com contains all the examples from the book, along with updated timing results on the latest released processors.
Zielgruppe
Professional and Professional Practice & Development
Autoren/Hrsg.
Fachgebiete
Weitere Infos & Material
Introduction. Multi-Core Architectures for the Applications Programmer. Compiling for Multi-Core Architectures. Programming for Cache-Based Architectures. Programming for DDE Instructions. Programming for Distributed Memory Clusters. Programming for Multi-Core Distributed Memory Clusters. Using OpenMP and Pthreads across the Cores within the Node. What the Programmer Needs to Do? Message Passing Issues. Performance Analysis. Application Analysis.