performance regression with SSE4.1
eb153 has introduced a performance regression in the non-bonded kernels of about 6-8% on SSE4.1 platforms (see log files attached).
The culprit seems to be the
gmx_invsqrt_pr() function in
include/gmx_simd_math_single.h which if swapped back to the previous SSE4.1 version, gets the performance back to the state previous to eb153.