[GitHub] [incubator-mxnet] anko-intel commented on issue #17971: DOT product too slow on CPU and GPU compared to np and pytorch

GitBox Tue, 26 May 2020 20:19:27 -0700


anko-intel commented on issue #17971:
URL: 
https://github.com/apache/incubator-mxnet/issues/17971#issuecomment-633696597

Hi @djaym7
Thank you for your results. I observe some similarity in the results
measured locally on Skylake-X i9-7920X on MxNet 1.6.0 cu102mkl binary. The
only exception is the time for 512x512 tensor on MxNet(?).
MxNet compiled from master branch (on b2144777b - fix (#18313)) uses MKL if
available, and the results are much better. But Mxnet is still worse than NumPy
for smaller tensors.

![image](https://user-images.githubusercontent.com/58251767/82837689-13bda700-9eca-11ea-90af-0fbd15f9e2c5.png)

Additional measurements on the master with MxNet Profiler enabled show that
> 80us is spent between python and time noted by Profiler for dot operation.
It seems to be an already know issue #14883 and #17097 regarding passing
python/C++ barrier. For me it sounds like fixing python-MXNet binding overhead
issue should also fix this issue.

![image](https://user-images.githubusercontent.com/58251767/82837709-20da9600-9eca-11ea-8df1-75f4e8f0be56.png)

Results in table below, neglecting measurement noise, shows that
differences between time measured in python and MKL are almost the same as
between python and MXNet Profiler, so it confirms python <-> C++ API issue.

![image](https://user-images.githubusercontent.com/58251767/82837739-364fc000-9eca-11ea-827c-5202bfcf87b1.png)

In the last table there are results for MxNet when both profiler and MKL
verbose are enabled (adding additional time for both measurements). We can see
here that the difference between python time and profile time is similar to the
results in the previous tables and it is the most significant one.

![image](https://user-images.githubusercontent.com/58251767/82837761-4cf61700-9eca-11ea-8eba-8091f0c48deb.png)

Exact results of my measurements could be find in logs:
[dot_issue_logs.zip](https://github.com/apache/incubator-mxnet/files/4678496/dot_issue_logs.zip)

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-mxnet] anko-intel commented on issue #17971: DOT product too slow on CPU and GPU compared to np and pytorch

Reply via email to