Project

General

Profile

Feature #3087

Feature #2816: GPU offload / optimization for update&constraits, buffer ops and multi-gpu communication

Feature #2915: GPU direct communications

enable GPU peer to peer access

Added by Szilárd Páll about 1 month ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
mdrun
Target version:
Difficulty:
uncategorized
Close

Description

For efficient direct GPU communications peer to peer access between GPUs in the run should be enabled.

This functionality should however be implemented such that all/most errors are handled explicitly and the function only aborts the run if a to be fatal error is detected, otherwise, as it is only a performance concern the run should continue.

Related: current working assumption is that even if peer access is not enabled direct copy should not be sower than staged copy, but as we are not sure, we might want to consider disabling the GPU direct copy if enabling peer access fails.


Related issues

Related to GROMACS - Feature #2890: GPU Halo ExchangeNew
Related to GROMACS - Feature #2891: PME/PP GPU communications New

History

#1 Updated by Szilárd Páll about 1 month ago

#2 Updated by Szilárd Páll about 1 month ago

Also available in: Atom PDF