Benchmarking CUDA Communication Primitives on High-Bandwidth Interconnects

This talk is related to work published in ICPE. You can find the paper here.