Thanks for replying! I know about @mratsim weave and yours Malebolgia.
But i work on a BLAS backend in pure nim, and for me it is better to use `createThread` directly even though it would have some `overhead` , this `fork-join` pattern suits well for some of the routines i have implemented. Is there some guide or code examples you could point me to understand better the behavior of thread variables or some gotchas i should look out for when using ` createThread` directly with ARC/ORC ? Most of my multithreading works fine and i have occasional allocation (using `allocShared`) for which i save pointer to call `deallocShared` when done.