Parallel Stencil Computation

Parallel Stencil Computation

High-performance, hybrid MPI+OpenMP implementation of a 2D stencil heat diffusion solver, explicitly optimized for the Leonardo DCGP cluster (Intel Sapphire Rapids).

Parallel ComputingPerformance OptimizationHPC
CMPIOpenMPSLURM

This project implements a numerical simulation of heat diffusion on a 2D plate using a stencil computation method, optimized for the Leonardo DCGP cluster (CINECA).


For a deep dive into the implementation details, compilation flags, and performance analysis, please refer to the technical documentation located in the GitHub repository (`docs/` folder). There you can find the following sections:
  • Parallelization Strategy: Detailed explanation of the domain decomposition, MPI communication, and OpenMP integration.
  • Compilation Guide: In-depth look at the Makefiles, compiler flags, and optimization levels.
  • Leonardo Cluster Specs: Technical specifications of the CINECA Leonardo DCGP partition used for testing.
  • Scalability Analysis: Strong and weak scaling results, including performance graphs and overlap efficiency analysis.
View on GitHub

Collaborators

Solo project