https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80038
--- Comment #4 from Richard Biener <rguenth at gcc dot gnu.org> --- It might be that GCC is in error and that the ctor/dtor need to be arranged to run per thread. Who knows... You might want to check a competing Cilk+ implementation for this (I think the Intel compiler has Cilk+ support).