Thank you, Aaron, for your interest on the topic.
My main previous proposal still stands: remove bundled OpenMP submodule and
use OpenMP provided by the environment [1]. This might lead to performance
degradation in some cases where an old OpenMP library is used or thread
affinity wasn't set prope
Thanks Aaron for the feedback.
> As for your next steps, would you propose that cmake be brought up to parity?
Yes. sse2 in cmake vs sse3 in make is a minor example without high impact.
There are others.
> It seems strange that it causes slowness and if so, it shouldn't be
> recommended for no