bgawrych commented on issue #18244: URL: https://github.com/apache/incubator-mxnet/issues/18244#issuecomment-636716666
@leezu > Thank you for looking into this! Do you have any insights into why this only affects MKL builds? This affects CPU (openBLAS) build as well. CPU build measurement: With ENABLE_TESTCOVERAGE=1 & unset OMP_NUM_THREADS: - test_operator.py::test_lstm_bidirectional – test time: 105.87s and 112.04s With ENABLE_TESTCOVERAGE=1 & OMP_NUM_THREADS=1: - test_operator.py::test_lstm_bidirectional – test time: 89.02s and 91.87s Without ENABLE_TESTCOVERAGE & unset OMP_NUM_THREADS: - test_operator.py::test_lstm_bidirectional – test time: 86.25s and 86.75s Without ENABLE_TESTCOVERAGE & OMP_NUM_THREADS=12: - test_operator.py::test_lstm_bidirectional – test time: 64.68s and 65.76s Last test case shows that openBLAS build have some problems with utilizing all threads. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org