Project

General

Profile

Task #3157

Feature #2816: GPU offload / optimization for update&constraits, buffer ops and multi-gpu communication

Feature #2891: PME/PP GPU communications

separate PME x receive sync

Added by Szilárd Páll about 1 month ago.

Status:
New
Priority:
Normal
Assignee:
Category:
mdrun
Target version:
-
Difficulty:
uncategorized
Close

Description

As agreed the data dependency sychronization should be implemented on the consumer task's end which is PME spread in the case of PME. PME-only ranks have the receive enqueue wait as soon as MPI returns. Consider assembling a list of events and passed to spread instead.
Consider whether having to receive from multiple PP ranks actually makes is more beneficial to overlap some receive with event wait enqueue.

Associated revisions

Revision c5595a8e (diff)
Added by Alan Gray 30 days ago

GPU Coordinate PME/PP Communications

Extends PmePpCommGpu class to provide PP-side support for coordinate
transfers from either GPU or CPU to PME task, and adds new
PmeCoordinateReceiverGpu class to recieve coordinate data directly to
the GPU on the PME task.

Implements part of #2817
Refs TODOs #3157 #3158 #3159

Change-Id: Iefa2bdfd9813282ad8b07feeb7691f16880e61a2

Also available in: Atom PDF