Project

General

Profile

Task #2519

Feature #2054: PME on GPU

Task #2453: PME OpenCL porting effort

Improve/remove PME OpenCL kernel barriers

Added by Aleksei Iupinov 5 months ago. Updated 9 days ago.

Status:
New
Priority:
High
Assignee:
-
Category:
-
Target version:
-
Difficulty:
hard
Close

Description

PME OpenCL kernels currently have additional synchronisation points, as compared to CUDA ones.
Some of those barriers should probably depend on minimal execution width (e.g. subgroup size?).
It might also be that some are needed at all. The purpose of this issue is to track all of them.
Relaxing any barrier requires rerunning Ewald unit tests on all supported and relevant platforms.
Hence it is probably beneficial achieve correctness on Intel GPUs first and only then start changing the barriers.


Related issues

Related to GROMACS - Task #2516: Support PME OpenCL execution width < 16New

History

#1 Updated by Aleksei Iupinov 5 months ago

  • Private changed from Yes to No

#2 Updated by Aleksei Iupinov 5 months ago

  • Related to Task #2516: Support PME OpenCL execution width < 16 added

#3 Updated by Gerrit Code Review Bot 9 days ago

Gerrit received a related patchset '1' for Issue #2519.
Uploader: Szilárd Páll ()
Change-Id: gromacs~master~I4c8f2cff405cd3044bd60b99f01bcdd918dc5d0e
Gerrit URL: https://gerrit.gromacs.org/8512

Also available in: Atom PDF