> master: > Run operation 40000000 iterations 12.851414 s, 3112K operations/s, 321ns per > coroutine > > paolo: > Run operation 40000000 iterations 11.951720 s, 3346K operations/s, 298ns per > coroutine
Nice. :) Can you please try "coroutine: Use __thread … " together, too? I still see 11% time spent in pthread_getspecific, and I get ~10% more indeed if I apply it here (my times are 191/160/145). Paolo