Optimized primitives for collective multi-GPU communication.
You can load the modules by:
module load modtree/gpu module load nccl