[GitHub] [incubator-mxnet] eric-haibin-lin commented on issue #18543: Gradient checkpointing in the Gluon interface

2020-07-23 Thread GitBox
eric-haibin-lin commented on issue #18543: URL: https://github.com/apache/incubator-mxnet/issues/18543#issuecomment-663098706 I'm still looking into this. Currently the mirror pass requires some shape/type information which is missing at the point of calling the pass

[GitHub] [incubator-mxnet] gigasquid commented on issue #17783: [RFC] MXNet 2.0 JVM Language development

2020-07-23 Thread GitBox
gigasquid commented on issue #17783: URL: https://github.com/apache/incubator-mxnet/issues/17783#issuecomment-663085890 @saudet @szha - I think we be a good path forward (from the Clojure perspective) This is an automated m

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18660: [numpy] fix flaky mixed precision binary error

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18660: URL: https://github.com/apache/incubator-mxnet/pull/18660#issuecomment-663083306 Jenkins CI successfully triggered : [centos-cpu] This is an automated message from the Apache Git Se

[GitHub] [incubator-mxnet] Yiyan66 commented on pull request #18660: [numpy] fix flaky mixed precision binary error

2020-07-23 Thread GitBox
Yiyan66 commented on pull request #18660: URL: https://github.com/apache/incubator-mxnet/pull/18660#issuecomment-663083249 @mxnet-bot run ci [centos-cpu] This is an automated message from the Apache Git Service. To respond to

[GitHub] [incubator-mxnet] szha commented on issue #18776: MXNet model export to ONNX failed

2020-07-23 Thread GitBox
szha commented on issue #18776: URL: https://github.com/apache/incubator-mxnet/issues/18776#issuecomment-663071856 Thanks for reporting. We are removing that op in the RNN layer in the 2.0. It will need to be registered in 1.x. That operator should be registered as simple concatenation fo

[GitHub] [incubator-mxnet] ptrendx opened a new pull request #18778: [v1.x] Cherry pick #18768

2020-07-23 Thread GitBox
ptrendx opened a new pull request #18778: URL: https://github.com/apache/incubator-mxnet/pull/18778 @szha This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18778: [v1.x] Cherry pick #18768

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18778: URL: https://github.com/apache/incubator-mxnet/pull/18778#issuecomment-663069406 Hey @ptrendx , Thanks for submitting the PR All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following co

[GitHub] [incubator-mxnet] josephevans commented on pull request #18752: Fix linalg_potri and linalg_potrf operators for large tensor.

2020-07-23 Thread GitBox
josephevans commented on pull request #18752: URL: https://github.com/apache/incubator-mxnet/pull/18752#issuecomment-663066996 @mxnet-bot run ci [unix-gpu] This is an automated message from the Apache Git Service. To respond

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18752: Fix linalg_potri and linalg_potrf operators for large tensor.

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18752: URL: https://github.com/apache/incubator-mxnet/pull/18752#issuecomment-663067039 Jenkins CI successfully triggered : [unix-gpu] This is an automated message from the Apache Git Serv

[GitHub] [incubator-mxnet] szha commented on issue #18774: unable to install mxnet package in R 4.0.1

2020-07-23 Thread GitBox
szha commented on issue #18774: URL: https://github.com/apache/incubator-mxnet/issues/18774#issuecomment-663065512 @niranjannilekani thanks for reporting. @hetong007 could you help on this? This is an automated message from

[GitHub] [incubator-mxnet] szha commented on issue #17783: [RFC] MXNet 2.0 JVM Language development

2020-07-23 Thread GitBox
szha commented on issue #17783: URL: https://github.com/apache/incubator-mxnet/issues/17783#issuecomment-663064169 @saudet this looks awesome! An 18% improvement in throughput is quite significant for switching the way of integration for a frontend binding. I think we should definitely st

[GitHub] [incubator-mxnet] anko-intel commented on a change in pull request #18777: [v1.7.x] ElementWiseSum fix for oneDNN

2020-07-23 Thread GitBox
anko-intel commented on a change in pull request #18777: URL: https://github.com/apache/incubator-mxnet/pull/18777#discussion_r459504958 ## File path: src/operator/tensor/elemwise_sum.cc ## @@ -118,11 +118,26 @@ void ElementWiseSumComputeExCPU(const nnvm::NodeAttrs& attrs,

[GitHub] [incubator-mxnet] anko-intel commented on a change in pull request #18777: [v1.7.x] ElementWiseSum fix for oneDNN

2020-07-23 Thread GitBox
anko-intel commented on a change in pull request #18777: URL: https://github.com/apache/incubator-mxnet/pull/18777#discussion_r459502487 ## File path: src/operator/tensor/elemwise_sum.cc ## @@ -118,11 +118,26 @@ void ElementWiseSumComputeExCPU(const nnvm::NodeAttrs& attrs,

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18777: [v1.7.x] ElementWiseSum fix for oneDNN

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18777: URL: https://github.com/apache/incubator-mxnet/pull/18777#issuecomment-663042745 Hey @bgawrych , Thanks for submitting the PR All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following c

[GitHub] [incubator-mxnet] bgawrych opened a new pull request #18777: [v1.7.x] ElementWiseSum fix for oneDNN

2020-07-23 Thread GitBox
bgawrych opened a new pull request #18777: URL: https://github.com/apache/incubator-mxnet/pull/18777 ## Description ## This PR fixes bug which occurs when training gluonCV deeplab with oneDNN support. Original issue: https://github.com/dmlc/gluon-cv/issues/1368 To reproduce:

[GitHub] [incubator-mxnet] saudet commented on issue #17783: [RFC] MXNet 2.0 JVM Language development

2020-07-23 Thread GitBox
saudet commented on issue #17783: URL: https://github.com/apache/incubator-mxnet/issues/17783#issuecomment-662994965 Hi, instead of JNA, I would be happy to provide bindings for the C API and maintain packages based on the JavaCPP Presets here: https://github.com/bytedeco/javacpp-preset

[GitHub] [incubator-mxnet] xizi opened a new issue #18776: MXNet model export to ONNX failed

2020-07-23 Thread GitBox
xizi opened a new issue #18776: URL: https://github.com/apache/incubator-mxnet/issues/18776 export rnn.GRU op failed, error message as follow: AttributeError: No conversion function registered for op type _rnn_param_concat yet. --

[GitHub] [incubator-mxnet] bgawrych commented on pull request #18708: [v1.x Backport] Fix softmax, logsoftmax failed on empty ndarray (#18602)

2020-07-23 Thread GitBox
bgawrych commented on pull request #18708: URL: https://github.com/apache/incubator-mxnet/pull/18708#issuecomment-662950170 @TaoLv @PatricZhao Can you take a look? CC: @Yiyan66 as owner of the fix npx.softmax for 0-sized inputs (#18158) --

[GitHub] [incubator-mxnet] Yiyan66 commented on pull request #18660: [numpy] fix flaky mixed precision binary error

2020-07-23 Thread GitBox
Yiyan66 commented on pull request #18660: URL: https://github.com/apache/incubator-mxnet/pull/18660#issuecomment-662903653 @mxnet-bot run ci [centos-cpu, centos-gpu] This is an automated message from the Apache Git Service. T

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18660: [numpy] fix flaky mixed precision binary error

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18660: URL: https://github.com/apache/incubator-mxnet/pull/18660#issuecomment-662903709 Jenkins CI successfully triggered : [centos-cpu, centos-gpu] This is an automated message from the A

[GitHub] [incubator-mxnet] chinakook commented on issue #18643: ndarray.contrib.boolean_mask can not be hybridize

2020-07-23 Thread GitBox
chinakook commented on issue #18643: URL: https://github.com/apache/incubator-mxnet/issues/18643#issuecomment-662876105 A very big and complex graph with dynamic ops may be fail to delayed infer. You can modularize them and then compose modules. --

[GitHub] [incubator-mxnet] wkcn commented on pull request #18714: [v1.6.x][Bug Fixed] Fix batch norm when grad_req is `add` (#18518)

2020-07-23 Thread GitBox
wkcn commented on pull request #18714: URL: https://github.com/apache/incubator-mxnet/pull/18714#issuecomment-662869997 @ChaiBapchya , This PR looks good to me too. : ) This is an automated message from the Apache Git Service

[GitHub] [incubator-mxnet] chinakook commented on issue #18643: ndarray.contrib.boolean_mask can not be hybridize

2020-07-23 Thread GitBox
chinakook commented on issue #18643: URL: https://github.com/apache/incubator-mxnet/issues/18643#issuecomment-662868075 I also encounter this problem, delayed infer may be fail. This is an automated message from the Apache G

[GitHub] [incubator-mxnet] bgawrych commented on pull request #18708: [v1.x Backport] Fix softmax, logsoftmax failed on empty ndarray (#18602)

2020-07-23 Thread GitBox
bgawrych commented on pull request #18708: URL: https://github.com/apache/incubator-mxnet/pull/18708#issuecomment-662852190 @mxnet-bot run ci [edge, unix-gpu] This is an automated message from the Apache Git Service. To respo

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18708: [v1.x Backport] Fix softmax, logsoftmax failed on empty ndarray (#18602)

2020-07-23 Thread GitBox
mxnet-bot commented on pull request #18708: URL: https://github.com/apache/incubator-mxnet/pull/18708#issuecomment-662852213 Jenkins CI successfully triggered : [edge, unix-gpu] This is an automated message from the Apache Gi

[GitHub] [incubator-mxnet] DickJC123 commented on pull request #18424: environment variable handling in unittests

2020-07-23 Thread GitBox
DickJC123 commented on pull request #18424: URL: https://github.com/apache/incubator-mxnet/pull/18424#issuecomment-662849662 @MoisesHer Thanks for your patience- I know you've been waiting for this PR's functionality for your PR. Stay tuned- I'm hopeful you'll be unblocked soon. ---

[GitHub] [incubator-mxnet] DickJC123 commented on pull request #18424: environment variable handling in unittests

2020-07-22 Thread GitBox
DickJC123 commented on pull request #18424: URL: https://github.com/apache/incubator-mxnet/pull/18424#issuecomment-662846028 Sorry I got side-tracked on a different PR and let this sit idle for awhile. At this point, I'm done with the prep of this PR and would be happy to respond to a rev

[GitHub] [incubator-mxnet] DickJC123 opened a new issue #18775: Failures seen in unittest test_gluon_probability{_v1,_v2}.py::test_gluon_cauchy_{v1,}

2020-07-22 Thread GitBox
DickJC123 opened a new issue #18775: URL: https://github.com/apache/incubator-mxnet/issues/18775 ## Description This was observed during the development of https://github.com/apache/incubator-mxnet/pull/18424. I've developed a fix which I've added via commit https://github.com/apache/

[GitHub] [incubator-mxnet] niranjannilekani opened a new issue #18774: unable to install mxnet package in R 4.0.1

2020-07-22 Thread GitBox
niranjannilekani opened a new issue #18774: URL: https://github.com/apache/incubator-mxnet/issues/18774 ## Description Iam unable to install mxnet package in R 4.0.1 ### Error Message package ‘mxnet’ was installed before R 4.0.0: please re-install it ## To Reproduce (I

[GitHub] [incubator-mxnet] leezu commented on issue #18764: Horovod issue with stable PyPi mxnet versions 1.6.0cu102

2020-07-22 Thread GitBox
leezu commented on issue #18764: URL: https://github.com/apache/incubator-mxnet/issues/18764#issuecomment-662839367 That's a separate problem. @eric-haibin-lin mentioned the problem does not apply to 1.x nightly build This

[GitHub] [incubator-mxnet] szha commented on issue #18764: Horovod issue with stable PyPi mxnet versions 1.6.0cu102

2020-07-22 Thread GitBox
szha commented on issue #18764: URL: https://github.com/apache/incubator-mxnet/issues/18764#issuecomment-662832752 It still seems to be the case in 2.0 in #18772 This is an automated message from the Apache Git Service. To

[GitHub] [incubator-mxnet] leezu edited a comment on issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
leezu edited a comment on issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772#issuecomment-662781755 Horovod includes the MXNet C++ headers and based on them interacts with the Engine: https://github.com/horovod/horovod/blob/cf022be959a7c9431a8415729758b26dec

[GitHub] [incubator-mxnet] leezu edited a comment on issue #10988: Flaky test: test_operator_gpu.test_countsketch

2020-07-22 Thread GitBox
leezu edited a comment on issue #10988: URL: https://github.com/apache/incubator-mxnet/issues/10988#issuecomment-662817730 ``` [2020-07-23T03:30:14.821Z] tests/python/gpu/test_numpy_fallback.py::test_np_fallback_decorator PASSED [ 19%] [2020-07-23T03:30:16.176Z] tests/python/gpu/te

[GitHub] [incubator-mxnet] leezu commented on issue #15832: test_operator_gpu.test_ndarray_reshape fails

2020-07-22 Thread GitBox
leezu commented on issue #15832: URL: https://github.com/apache/incubator-mxnet/issues/15832#issuecomment-662817805 https://github.com/apache/incubator-mxnet/issues/10988 This is an automated message from the Apache Git Serv

[GitHub] [incubator-mxnet] leezu commented on issue #10988: Flaky test: test_operator_gpu.test_countsketch

2020-07-22 Thread GitBox
leezu commented on issue #10988: URL: https://github.com/apache/incubator-mxnet/issues/10988#issuecomment-662817730 ``` [2020-07-23T03:30:14.821Z] tests/python/gpu/test_numpy_fallback.py::test_np_fallback_decorator PASSED [ 19%] [2020-07-23T03:30:16.176Z] tests/python/gpu/test_oper

[GitHub] [incubator-mxnet] josephevans commented on pull request #18752: Fix linalg_potri and linalg_potrf operators for large tensor.

2020-07-22 Thread GitBox
josephevans commented on pull request #18752: URL: https://github.com/apache/incubator-mxnet/pull/18752#issuecomment-662817291 @mxnet-bot run ci [website] This is an automated message from the Apache Git Service. To respond t

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18752: Fix linalg_potri and linalg_potrf operators for large tensor.

2020-07-22 Thread GitBox
mxnet-bot commented on pull request #18752: URL: https://github.com/apache/incubator-mxnet/pull/18752#issuecomment-662817305 Jenkins CI successfully triggered : [website] This is an automated message from the Apache Git Servi

[GitHub] [incubator-mxnet] ChaiBapchya commented on pull request #18714: [v1.6.x][Bug Fixed] Fix batch norm when grad_req is `add` (#18518)

2020-07-22 Thread GitBox
ChaiBapchya commented on pull request #18714: URL: https://github.com/apache/incubator-mxnet/pull/18714#issuecomment-662811732 @wkcn Appreciate your help with this cherry-pick. Changes look good to me. Can we merge it? This

[GitHub] [incubator-mxnet] wkcn commented on pull request #18714: [v1.6.x][Bug Fixed] Fix batch norm when grad_req is `add` (#18518)

2020-07-22 Thread GitBox
wkcn commented on pull request #18714: URL: https://github.com/apache/incubator-mxnet/pull/18714#issuecomment-662805396 Hi @ChaiBapchya , all tests passed : ) This is an automated message from the Apache Git Service. To respo

[GitHub] [incubator-mxnet] ChaiBapchya commented on pull request #18773: Back port optimization to broadcast_axis to MXNet1.x

2020-07-22 Thread GitBox
ChaiBapchya commented on pull request #18773: URL: https://github.com/apache/incubator-mxnet/pull/18773#issuecomment-662787469 Looks good since it's a cherry-pick. For functionality check, broadcast_axis specific tests in unix-cpu should verify that on CI. For performance check, do y

[GitHub] [incubator-mxnet] ptrendx merged pull request #18742: [v1.x] Cherrypick #18677 #18713

2020-07-22 Thread GitBox
ptrendx merged pull request #18742: URL: https://github.com/apache/incubator-mxnet/pull/18742 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [incubator-mxnet] access2rohit commented on pull request #18773: Back port optimization to broadcast_axis to MXNet1.7.x

2020-07-22 Thread GitBox
access2rohit commented on pull request #18773: URL: https://github.com/apache/incubator-mxnet/pull/18773#issuecomment-662783995 @leezu @sandeep-krishnamurthy can you help merge ? These are cherry-picked from master(already merged there)

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18773: Back port optimization to broadcast_axis to MXNet1.7.x

2020-07-22 Thread GitBox
mxnet-bot commented on pull request #18773: URL: https://github.com/apache/incubator-mxnet/pull/18773#issuecomment-662783346 Hey @access2rohit , Thanks for submitting the PR All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the followi

[GitHub] [incubator-mxnet] access2rohit opened a new pull request #18773: Back port optimization to broadcast_axis to MXNet1.7.x

2020-07-22 Thread GitBox
access2rohit opened a new pull request #18773: URL: https://github.com/apache/incubator-mxnet/pull/18773 ## Description ## Back port optimization to broadcast_axis for CPU and GPU to MXNet1.7.x ## Checklist ## ### Essentials ### Please feel free to remove inapplicable items f

[GitHub] [incubator-mxnet] leezu edited a comment on issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
leezu edited a comment on issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772#issuecomment-662781755 Horovod includes the MXNet C++ headers and based on them interacts with the Engine: https://github.com/horovod/horovod/blob/cf022be959a7c9431a8415729758b26dec

[GitHub] [incubator-mxnet] leezu edited a comment on issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
leezu edited a comment on issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772#issuecomment-662781755 Horovod includes the MXNet C++ headers and based on them interacts with the Engine: https://github.com/horovod/horovod/blob/cf022be959a7c9431a8415729758b26dec

[GitHub] [incubator-mxnet] leezu commented on issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
leezu commented on issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772#issuecomment-662781755 Horovod includes the MXNet C++ headers and based on them interacts with the Engine: https://github.com/horovod/horovod/blob/cf022be959a7c9431a8415729758b26dec1a87e5/

[GitHub] [incubator-mxnet] ZheyuYe commented on issue #18766: Horovod support in MXNet 2

2020-07-22 Thread GitBox
ZheyuYe commented on issue #18766: URL: https://github.com/apache/incubator-mxnet/issues/18766#issuecomment-662778617 @eric-haibin-lin Thanks a lot. I'll try this version. This is an automated message from the Apache Git Se

[GitHub] [incubator-mxnet] DickJC123 commented on pull request #18424: environment variable handling in unittests

2020-07-22 Thread GitBox
DickJC123 commented on pull request #18424: URL: https://github.com/apache/incubator-mxnet/pull/18424#issuecomment-662773737 I have another clean-up commit or two on this... This is an automated message from the Apache Git Se

[GitHub] [incubator-mxnet] eric-haibin-lin commented on issue #18766: Horovod support in MXNet 2

2020-07-22 Thread GitBox
eric-haibin-lin commented on issue #18766: URL: https://github.com/apache/incubator-mxnet/issues/18766#issuecomment-662772739 I have a version that seems to work with mxnet built from source here: https://github.com/eric-haibin-lin/horovod/tree/mx2 feel free to try it out if you need i

[GitHub] [incubator-mxnet] eric-haibin-lin commented on issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
eric-haibin-lin commented on issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772#issuecomment-662772210 ``` [1,0]:(gdb) bt [1,0]:#0 0x77419b80 in pthread_mutex_lock () from /lib64/libpthread.so.0 [1,0]:#1 0x7fff68a1b81d in mxnet::engine::Thread

[GitHub] [incubator-mxnet] eric-haibin-lin opened a new issue #18772: horovod seg-fault with mxnet pip wheels

2020-07-22 Thread GitBox
eric-haibin-lin opened a new issue #18772: URL: https://github.com/apache/incubator-mxnet/issues/18772 I am working on a bug fix for mxnet master with my horovod branch: https://github.com/eric-haibin-lin/horovod/tree/mx2 I noticed that the example passes if I use mxnet built from s

[GitHub] [incubator-mxnet] gilbertfrancois edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662755383 Coming back to mxnet: it looks like it is possible to do a forward pass (inference mode) on cpu when the BatchNorm is placed with Dense layers. Because on C

[GitHub] [incubator-mxnet] gilbertfrancois commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662756717 I suspect that the behaviour is corrected when the update of moving_mean and moving_var on GPU is done in the backward pass, like it is on CPU. It will solve the N

[GitHub] [incubator-mxnet] gilbertfrancois edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662755383 Coming back to mxnet: it looks like it is possible to do a forward pass (inference mode) on cpu when the BatchNorm is placed with Dense layers. Because on C

[GitHub] [incubator-mxnet] gilbertfrancois commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662755383 Coming back to mxnet: it looks like it is possible to do a forward pass (inference mode) on cpu when the BatchNorm is placed with Dense layers. But on gpu, it trie

[GitHub] [incubator-mxnet] gilbertfrancois edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662750082 Ok, I see that. But I guess it is the same intended behaviour as pyTorch nn.BatchNorm1d for Dense layers, which takes as input (N, C). The normalization is

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18771: CI: Migrate remaining Dockerfiles to docker-compose.yml and remove unused code

2020-07-22 Thread GitBox
mxnet-bot commented on pull request #18771: URL: https://github.com/apache/incubator-mxnet/pull/18771#issuecomment-662753844 Hey @leezu , Thanks for submitting the PR All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following comm

[GitHub] [incubator-mxnet] leezu opened a new pull request #18771: CI: Migrate remaining Dockerfiles to docker-compose.yml and remove unused code

2020-07-22 Thread GitBox
leezu opened a new pull request #18771: URL: https://github.com/apache/incubator-mxnet/pull/18771 - Delete unused Dockerfiles - Delete unused install/*.sh scripts - Consolidate ubuntu_gpu_tensorrt and ubuntu_gpu - Remove deprecated logic in ci/build.py (no longer needed with do

[GitHub] [incubator-mxnet] gilbertfrancois edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662750082 Ok, I see that. But I guess it is the same intended behaviour as pyTorch nn.BatchNorm1d for Dense layers, which takes as input (N, C). The normalization is

[GitHub] [incubator-mxnet] mxnet-bot commented on pull request #18742: [v1.x] Cherrypick #18677 #18713

2020-07-22 Thread GitBox
mxnet-bot commented on pull request #18742: URL: https://github.com/apache/incubator-mxnet/pull/18742#issuecomment-662752044 Jenkins CI successfully triggered : [unix-gpu] This is an automated message from the Apache Git Serv

[GitHub] [incubator-mxnet] ChaiBapchya commented on pull request #18742: [v1.x] Cherrypick #18677 #18713

2020-07-22 Thread GitBox
ChaiBapchya commented on pull request #18742: URL: https://github.com/apache/incubator-mxnet/pull/18742#issuecomment-662752028 @mxnet-bot run ci [unix-gpu] This is an automated message from the Apache Git Service. To respond

[GitHub] [incubator-mxnet] gilbertfrancois commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662750082 Ok, I see that. But I guess it is the same intended behaviour as pyTorch nn.BatchNorm1d, which takes as input (N, C). The normalization is done over C features. E.

[GitHub] [incubator-mxnet] nabulsi commented on issue #18759: Jetson: Segmentation Fault 11 When Importing MXNET

2020-07-22 Thread GitBox
nabulsi commented on issue #18759: URL: https://github.com/apache/incubator-mxnet/issues/18759#issuecomment-662740615 @mseth10 the wheel is currently enough for me. I can move forward now, but I am worried if in the next few days/weeks I find that I need something more and I will have to

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662730136 After nn.Flatten(), the batch norm is actually performed on a 1xCx1x1 Tensor, where C is 9408 for the first batch norm layer in tail, and it is 32 for the second b

[GitHub] [incubator-mxnet] TristonC commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662730136 After nn.Flatten(), the batch norm is actually performed on a 1xCx1x1 Tensor, where C is 9408 for the first batch norm layer in tail, and it is 32 for the second batch mo

[GitHub] [incubator-mxnet] gilbertfrancois commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
gilbertfrancois commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662729925 Hi @TristonC, the project is for training. I adapted the script. It It does now one training step, with forward - backward pass and a validation step. The h

[GitHub] [incubator-mxnet] TristonC removed a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC removed a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662721253 It looks like there is a bug there for doing batch norm with 1D array, when the batch size is 1. For example, in this case, after flat, there vector size is 9408,

[GitHub] [incubator-mxnet] TristonC commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662721253 It looks like there is a bug there for doing batch norm with 1D array, when the batch size is 1. For example, in this case, after flat, there vector size is 9408, which m

[GitHub] [incubator-mxnet] DickJC123 commented on a change in pull request #18424: environment variable handling in unittests

2020-07-22 Thread GitBox
DickJC123 commented on a change in pull request #18424: URL: https://github.com/apache/incubator-mxnet/pull/18424#discussion_r459109387 ## File path: src/c_api/c_api_test.cc ## @@ -106,3 +106,25 @@ int MXRemoveSubgraphPropertyOpNamesV2(const char* prop_name) { } API_END(

[GitHub] [incubator-mxnet] DickJC123 commented on a change in pull request #18424: environment variable handling in unittests

2020-07-22 Thread GitBox
DickJC123 commented on a change in pull request #18424: URL: https://github.com/apache/incubator-mxnet/pull/18424#discussion_r459107133 ## File path: tests/python/gpu/test_kvstore_gpu.py ## @@ -20,8 +20,8 @@ import os import mxnet as mx import numpy as np -import pytest -fro

[GitHub] [incubator-mxnet] mseth10 commented on issue #18759: Jetson: Segmentation Fault 11 When Importing MXNET

2020-07-22 Thread GitBox
mseth10 commented on issue #18759: URL: https://github.com/apache/incubator-mxnet/issues/18759#issuecomment-662716632 @nabulsi that's great news. I have not yet tested the cross compilation script provided on the installation page, and it might need some fixing. Until that is done, is the

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given

[GitHub] [incubator-mxnet-site] aaronmarkham merged pull request #66: sync beta website with the latest mxnet website

2020-07-22 Thread GitBox
aaronmarkham merged pull request #66: URL: https://github.com/apache/incubator-mxnet-site/pull/66 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given

[GitHub] [incubator-mxnet] TristonC commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given that th

[GitHub] [incubator-mxnet] TristonC edited a comment on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC edited a comment on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662687910 @gilbertfrancois I did a quick test, to answer your question: > I don't understand why y_out from MyNet with BatchNorm on GPU still contains real numbers, given

[GitHub] [incubator-mxnet-site] ys2843 opened a new pull request #66: sync beta website with the latest mxnet website

2020-07-22 Thread GitBox
ys2843 opened a new pull request #66: URL: https://github.com/apache/incubator-mxnet-site/pull/66 This PR contains a latest website build based on mxnet master branch. This is to sync beta-stage website with the latest mxnet website. ---

[GitHub] [incubator-mxnet] leezu opened a new issue #18770: NumPy byteorder incompatibility

2020-07-22 Thread GitBox
leezu opened a new issue #18770: URL: https://github.com/apache/incubator-mxnet/issues/18770 MXNet ignores the byte-order of numpy data-types and always uses native byte-order. ``` >>> import mxnet as mx >>> import numpy as np >>> print(mx.np.arange(10, dtype=np.dtype('>> pr

[GitHub] [incubator-mxnet] szha commented on issue #16167: [RFC] Apache MXNet 2.0 Roadmap

2020-07-22 Thread GitBox
szha commented on issue #16167: URL: https://github.com/apache/incubator-mxnet/issues/16167#issuecomment-662620865 @fhieber we are planning to release the first public beta on this somewhere in August. At the moment we are finalizing some API changes and also validating them in GluonNLP.

[GitHub] [incubator-mxnet] szha commented on pull request #18478: [v1.7.x]License checker enhancement

2020-07-22 Thread GitBox
szha commented on pull request #18478: URL: https://github.com/apache/incubator-mxnet/pull/18478#issuecomment-662619908 Yes sounds good. This is an automated message from the Apache Git Service. To respond to the message, ple

[GitHub] [incubator-mxnet-site] ys2843 commented on pull request #65: Test redirect rule based on referer

2020-07-22 Thread GitBox
ys2843 commented on pull request #65: URL: https://github.com/apache/incubator-mxnet-site/pull/65#issuecomment-662605128 > Please add description. Done, @sandeep-krishnamurthy This is an automated message from the Ap

[GitHub] [incubator-mxnet-site] sandeep-krishnamurthy commented on pull request #65: Test redirect rule based on referer

2020-07-22 Thread GitBox
sandeep-krishnamurthy commented on pull request #65: URL: https://github.com/apache/incubator-mxnet-site/pull/65#issuecomment-662596749 Please add description. This is an automated message from the Apache Git Service. To res

[GitHub] [incubator-mxnet] wkcn commented on pull request #18714: [v1.6.x][Bug Fixed] Fix batch norm when grad_req is `add` (#18518)

2020-07-22 Thread GitBox
wkcn commented on pull request #18714: URL: https://github.com/apache/incubator-mxnet/pull/18714#issuecomment-662593292 Hi @ChaiBapchya , I found a compilation error in http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-18714/3/pipeline

[GitHub] [incubator-mxnet-site] ys2843 opened a new pull request #65: Test redirect rule based on referer

2020-07-22 Thread GitBox
ys2843 opened a new pull request #65: URL: https://github.com/apache/incubator-mxnet-site/pull/65 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [incubator-mxnet] xidulu commented on pull request #18758: [WIP] [Gluon] Add VAE demo

2020-07-22 Thread GitBox
xidulu commented on pull request #18758: URL: https://github.com/apache/incubator-mxnet/pull/18758#issuecomment-662566711 @leezu Thx for pointing that out, I will take a look at it. This is an automated message from the A

[GitHub] [incubator-mxnet] szha edited a comment on pull request #18758: [WIP] [Gluon] Add VAE demo

2020-07-22 Thread GitBox
szha edited a comment on pull request #18758: URL: https://github.com/apache/incubator-mxnet/pull/18758#issuecomment-662183511 ~not yet. I think we should provide the automation in CI first, which may take some time. in the meantime, I think we can check in the jupyter notebook for now wit

[GitHub] [incubator-mxnet] leezu commented on pull request #18758: [WIP] [Gluon] Add VAE demo

2020-07-22 Thread GitBox
leezu commented on pull request #18758: URL: https://github.com/apache/incubator-mxnet/pull/18758#issuecomment-662563255 @xidulu @szha it's supported via https://github.com/apache/incubator-mxnet/blob/243ade93bcb8b7962d1faeb89c98409e3ae0d7a4/docs/python_docs/python/Makefile#L31-L33 --

[GitHub] [incubator-mxnet] TristonC commented on issue #18751: gluon.nn.BatchNorm seems to swap updated values of moving_mean and moving_var on GPU.

2020-07-22 Thread GitBox
TristonC commented on issue #18751: URL: https://github.com/apache/incubator-mxnet/issues/18751#issuecomment-662547038 @gilbertfrancois Is your project for training or inference? In your script, it uses autograd, but it does not do backward(). The reason I asked this, is BatchNorm behave

[GitHub] [incubator-mxnet-site] aaronmarkham merged pull request #64: test redirect rules - set v1.6 as default website

2020-07-22 Thread GitBox
aaronmarkham merged pull request #64: URL: https://github.com/apache/incubator-mxnet-site/pull/64 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[GitHub] [incubator-mxnet] marcoabreu commented on issue #18753: PRs that have passed CI, but have "aged" considerably w.r.t. master, can still be merged.

2020-07-22 Thread GitBox
marcoabreu commented on issue #18753: URL: https://github.com/apache/incubator-mxnet/issues/18753#issuecomment-662538274 Since resolving a merge conflict will always result in a new commit hash, it's inevitable that it's rerun by CI. Merge conflict or not - a PR status should go sta

[GitHub] [incubator-mxnet] DickJC123 commented on issue #18753: PRs that have passed CI, but have "aged" considerably w.r.t. master, can still be merged.

2020-07-22 Thread GitBox
DickJC123 commented on issue #18753: URL: https://github.com/apache/incubator-mxnet/issues/18753#issuecomment-662534016 When my in-development PRs become unmergeable due to other accepted PRs, I merge the current master with my PR and push the result, which generates another CI run. Is i

[GitHub] [incubator-mxnet] fhieber commented on issue #16167: [RFC] Apache MXNet 2.0 Roadmap

2020-07-22 Thread GitBox
fhieber commented on issue #16167: URL: https://github.com/apache/incubator-mxnet/issues/16167#issuecomment-662345601 @szha is there a recent estimate on the timeline for MXNet 2.0? Would you recommend to develop downstream toolkits (e.g. Sockeye) against the master branch now or rather w

[GitHub] [incubator-mxnet] suyz526 edited a comment on issue #18727: module doc missing

2020-07-22 Thread GitBox
suyz526 edited a comment on issue #18727: URL: https://github.com/apache/incubator-mxnet/issues/18727#issuecomment-662315508 Hi, In [https://mxnet.apache.org/api/python/docs/api](https://mxnet.apache.org/api/python/docs/api), almost every page is blank, e.g. [ndarray](https://mxn

[GitHub] [incubator-mxnet] suyz526 commented on issue #18727: module doc missing

2020-07-22 Thread GitBox
suyz526 commented on issue #18727: URL: https://github.com/apache/incubator-mxnet/issues/18727#issuecomment-662315508 Hi, In [https://mxnet.apache.org/api/python/docs/api](https://mxnet.apache.org/api/python/docs/api), almost every page is empty. This 'beta' page works: [

[GitHub] [incubator-mxnet] ciyongch commented on pull request #18478: [v1.7.x]License checker enhancement

2020-07-21 Thread GitBox
ciyongch commented on pull request #18478: URL: https://github.com/apache/incubator-mxnet/pull/18478#issuecomment-662275312 Ok, got it, then let's keep it as is now and try to finalize the fix solution in the next release. Actually how to handle the dual license issue or re-license the t

[GitHub] [incubator-mxnet] ChaiBapchya commented on pull request #18714: [v1.6.x][Bug Fixed] Fix batch norm when grad_req is `add` (#18518)

2020-07-21 Thread GitBox
ChaiBapchya commented on pull request #18714: URL: https://github.com/apache/incubator-mxnet/pull/18714#issuecomment-662272392 @wkcn cherry-picking 1.x into 1.6 doesn't quite resolve CI issues.. any idea? This is an automated

[GitHub] [incubator-mxnet] ChaiBapchya commented on pull request #18742: [v1.x] Cherrypick #18677 #18713

2020-07-21 Thread GitBox
ChaiBapchya commented on pull request #18742: URL: https://github.com/apache/incubator-mxnet/pull/18742#issuecomment-662271682 @mxnet-bot run ci [unix-gpu] This is an automated message from the Apache Git Service. To respond

<    2   3   4   5   6   7   8   9   10   11   >