PME gather CUDA kernel failing on Fermi
For inputs larger than ~350k:
Error while launching kernel pme_gather_kernel: invalid configuration argument
Reproduced on C2070 and GTX 580.
Fix PME for large systems with Fermi GPUs
PME spread/gather CUDA kernel scheduling did not account for
compute capability limitations. Realistically this has only
caused it to fail on CC 2.x with input systems larger than
2^18 ~= 262k atoms. This is now fixed for all CUDA architectures.
#8 Updated by Szilárd Páll over 2 years ago
Mark Abraham wrote:
NB we have the GMX_DEVELOPER_BUILD cmake configuration that could be used to enable such things. And of course we'd generate the contents of such a test system rather than store the coordinates.
That could indeed be useful.
On a side-note it seems to me that it would be quite appropriate for Google test (and/or CTest) to support subsets of tests to not be compulsory and by default issue only warning/note when such a test fails; these tests can be turned compulsory and warnings emitted as failures in our controlled CI environment. This way a test that fails due to out-of-memory error (because the user's browser chews up 1.65 of the 2 Gb GPU memory) won't mark a unit test failed in the users' hands, but if it terminates with some weird error we may still learn about it. Can this be done with the available features?