Project

General

Profile

Task #2818

bonded GPU kernel fusion

Added by Szilárd Páll 28 days ago. Updated 26 days ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
mdrun
Target version:
-
Difficulty:
uncategorized
Close

Description

The launch overhead of the bonded kernels often becomes so significant that it outweighs any benefit of GPU offload. This could be mitigated with a few optimizations: most importantly kernel fusion.

Multiple approaches are possible:
  • a simple decomposition of different types of bonded interactions over different blocks
  • a more locality aware-decomposition would however be beneficial: interactions that share the coordinates computed in the same block.

Additionally, update groups could also be implemented in the decomposition (e.g. through block sorting which should not be difficult if coordinate ranges are already the unit of work) to prioritize work on the critical path for staggered update as well as for DD runs.


Related issues

Related to GROMACS - Task #2694: bonded CUDA kernelsClosed

History

#1 Updated by Szilárd Páll 28 days ago

  • Related to Task #2694: bonded CUDA kernels added

#2 Updated by Szilárd Páll 26 days ago

  • Description updated (diff)

Also available in: Atom PDF