Skip to content

MPI CUDA-aware support

Opening this issue to find a solution to MPI CUDA-aware support. In this NVIDIA blog it is mentioned the need to call cudaSetDevice before MPI_Init, depending on the implementation. However Kokkos::initialize is supposed to be called after MPI_Init, reference.

Solutions:

  • Should we manually do the GPU binding ?
  • Drop support for MPI CUDA-aware ?
  • Patch only for Omni-Path, hence Jean-Zay

Third option may be the most pragmatic solution for now.

Edited by thomas.padioleau
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information