anirudh2290 commented on issue #13449: significant performance regression in SpMV URL: https://github.com/apache/incubator-mxnet/issues/13449#issuecomment-443388522 Found the root cause of the issue: After the PR: #12380 , omp_thread_max_ is mutated in set_reserve_cores. This means for each gpu worker the omp_thread_max_ will keep dropping. For 8 GPU workers, it drops till it it is 1. After this, the dot operator execution internally calls `GetRecommendedOMPThreadCount` which `return omp_thread_max_` which is 1. Thus the dot operator executes on a single thread. For now, reverting the PR to the old behavior is a good option. We should also try to understand more on cause of the segfault which was the reason for the PR #12380 and come up with a different fix.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
