change rolling pruning scheduling with GPU update
When GPU update is offloaded the list re-pruning can no longer be done "for free" overlapped with CPU tasks, so we should probably just prune in one batch like we do on the CPU.
It might be worth to re-assess whether there is any inter-step overlap to be had (e.g. with other memset or H2D/D2H tasksk) and whether it is worth trading pruning kernel efficiency loss (due to smaller batch execution) with overlapping with other tasks.
#2 Updated by Szilárd Páll about 1 month ago
- Target version changed from 2020-rc1 to 2020
Bumped to 2020 final (in case if somebody can pick it up). Only affects performance (and possibly by not much).
Note: need to pass the simulationWorklaod to init_forcerec()->Nbnxm::init_nb_verlet()->setupDynamicPairlistPruning() and set the
listParams->numRollingPruningParts = 1.