MPI CUDA-aware support

Opening this issue to find a solution to MPI CUDA-aware support. In this NVIDIA blog it is mentioned the need to call cudaSetDevice before MPI_Init, depending on the implementation. However Kokkos::initialize is supposed to be called after MPI_Init, reference.

Solutions:

  • Should we manually do the GPU binding ?
  • Drop support for MPI CUDA-aware ?
  • Patch only for Omni-Path, hence Jean-Zay

Third option may be the most pragmatic solution for now.

Edited Nov 14, 2019 by thomas.padioleau
Assignee Loading
Time tracking Loading