anirudh2290 commented on issue #13449: significant performance regression in 
SpMV
URL: 
https://github.com/apache/incubator-mxnet/issues/13449#issuecomment-443388522
 
 
   Found the root cause of the issue: After the PR: #12380 , omp_thread_max_ is 
mutated in set_reserve_cores. This means for each gpu worker the 
omp_thread_max_ will keep dropping. For 8 GPU workers, it drops till it it is 
1. After this, the dot operator execution internally calls 
`GetRecommendedOMPThreadCount` which `return omp_thread_max_` which is 1. Thus 
the dot operator executes on a single thread. For now, reverting the PR to the 
old behavior is a good option. We should also try to understand more on cause 
of the segfault which was the reason for the PR #12380 and come up with a 
different fix.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to