Task #2792: Improvement of PME gather and spread CUDA kernels
Update regression tests for new kernel flavours
We now have 4 potential code paths through the spread and gather. Once the tuning is done they will be automatically selected.
The current spread of regression tests will not test all the flavors of the spread and gather kernels, so any future changes could potentially break one of the code paths without being picked up by the regression tests.