@jasonyu1996 There is a PR to fix this issue https://github.com/dmlc/nnvm/pull/525 and an associated change on mxnet https://github.com/apache/incubator-mxnet/pull/12479. I know that this issue is blocking everyone else so hopefully this gets through soon and we can get your changes merged too.
Thanks for all of your explanations on the code (especially the code you didnt write)! I hope you dont mind if I ask some more while we wait for the nnvm fix. Also, can you explain the backward pass with the diagonal operator? What exactly is happening there, the diagonal comes in and the original input matrix gets updated? What is the ML meaning in the diag case; what error is there to correct in this calculation? [ Full content available at: https://github.com/apache/incubator-mxnet/pull/12430 ] This message was relayed via gitbox.apache.org for [email protected]
