evaluate the impact of particle order on PME
The DD sorting does have an impact on PME performance, especially on GPUs. In current code this effect can be measured with single rank vs separate PME rank runs.
This impact should be evaluated across a range of input sizes (possibly densities?).