论文标题
拉格朗日运输模型的改进的多GPU并行化
Improved Multi-GPU parallelization of a Lagrangian Transport Model
论文作者
论文摘要
该报告强调了我们通过支持多个GPU的计算节点来改善GPU并行化的工作。但是,由于OpenACC中对多GPU的默认支持受到限制[6],因此当前的实现允许每个MPI进程仅访问一个GPU。因此,充分利用当前版本中多GPU节点的唯一方法是启动多个流程,从而增加了资源争议。我们研究了只有一个流程卸载到所有可用的GPU设备的好处。
This report highlights our work on improving GPU parallelization by supporting compute nodes with multiple GPUs. However, since the default support for multi-GPUs in OpenACC is limited[6], the current implementation allows each MPI process to access only a single GPU. Thus, the only way to take full advantage of multi-GPU nodes in the current version is to launch multiple processes, which increases resource contention. We investigated the benefits of having only one process offload to all available GPU devices.