This repository contains the OpenACC NPB3.3 source from https://github.com/khaki3/acc-saturator and the translated OpenMP5 version of these benchmarks, as the main result of this work. The translation was primarily achieved using the Intel Application Migration Tool for OpenACC to OpenMP (Intel Technical Article, GitHub Repository).
- Source code of the OpenACC NPB3.3 implementation, as presented in https://github.com/khaki3/acc-saturator. This work is based on a previous work of Xu et. al. https://github.com/uhhpctools/openacc-npb.
- The Intel Application Migration Tool for OpenACC to OpenMP (Intel Technical Article, GitHub Repository).
- The result of applying the Intel migration tool to each file in the OMP-ACCrepository.
- Translation Process The Intel migration tool was applied to each source file using the following command:
intel-application-migration-tool-for-openacc-to-openmp -overwrite-input -suppress-openacc file.c- Note: The config/make.deffile was manually modified to support the compilation of OpenMP directives, particularly for oneAPI and NVHPC compilers:
# Intel compiler for Intel spir64
ifeq ($(CCC), icx-pvc)
    CC = icx
    CFLAGS = -O3 -fp-model fast -fno-alias -fiopenmp -fopenmp-targets=spir64 -mcmodel=medium -Wl,--no-relax
    CLINK = icx
    CLINKFLAGS = -O3 -fp-model fast -fno-alias -fiopenmp -fopenmp-targets=spir64 -mcmodel=medium -Wl,--no-relax
    UCC = icx
endif
# NVIDIA compiler for NVIDIA A100
ifeq ($(CCC), pgcc-a100)
    CC = pgcc
    CFLAGS = -mp=gpu -gpu=cc80 -Minfo=accel -Msafeptr -O3 -mcmodel=medium -I/usr/src/kernels/4.18.0-553.5.1.el8_10.x86_64/include
    CLINK = $(CC)
    CLINKFLAGS = -mp=gpu -gpu=cc80 -Minfo=accel -Msafeptr -O3 -mcmodel=medium -I/usr/src/kernels/4.18.0-553.5.1.el8_10.x86_64/include
    UCC = pgcc
endif
- The OpenMP5 NPB version after some manual modifications to ensure the compliance to NVIDIA and Intel compilers&GPUs, and ensure the correctness in both systems.
- The directory Manual_Changes_diff contains the documentation of the manual changes (were produced with vimdiff).
To compile and execute a benchmark (for example BT, class C, on NVIDIA A100 GPU, with NVHPC compiler):
cd NPB-OMP5-Offload-Translated-and-Manual-Fixing/BT/BT
CCC=pgcc-a100 make CLASS=C
./bt.C.x