Project

General

Profile

Bug #3241

Bonded GPU kernel launched in the wrong stream with 1 PP + 1 PME rank

Added by Szilárd Páll about 1 year ago. Updated about 1 year ago.

Status:
Closed
Priority:
Normal
Category:
mdrun
Target version:
Affected version - extra info:
Affected version:
Difficulty:
uncategorized
Close

Description

Due to a confusion od the DOMAINDECOMP(cr) check, when there is only 1 PP and 1 PME rank (and the check evaluates to true), the bonded module gets initialized with a null stream and therefore ends up blocking the overlap of other operations when it gets launched.


Related issues

Related to GROMACS - Task #2395: break up commrecIn Progress

Associated revisions

Revision e8ea4480 (diff)
Added by Szilárd Páll about 1 year ago

Fix the GPU bonded stream with 1 PP + 1 PME rank

WIth 1 PP + 1 PME rank the GpuBonded constructor gets passed the
non-local nonbonded stream which is nullptr and as a result the bonded
kernel launch happens in the default stream blocking concurrent
kernel execution.
This change makes sure that only when there is PP domain decomposition
is the GpuBonded constructor passed the nonlocal stream.

Fixes #3241

Change-Id: I858401b78c620adc3bea176e40e6fa179e583483

History

#1 Updated by Szilárd Páll about 1 year ago

  • Status changed from New to In Progress

#2 Updated by Mark Abraham about 1 year ago

#3 Updated by Szilárd Páll about 1 year ago

  • Status changed from In Progress to Resolved

#4 Updated by Paul Bauer about 1 year ago

  • Status changed from Resolved to Closed

Also available in: Atom PDF