This is an automated email from the ASF dual-hosted git repository.

taolv pushed a change to branch mkldnn-v1.0
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


    from 5e5fe04  [mkldnn-v1.0] Initiate the transition to MKL-DNN v1.0 (#15706)
     add 8e31dad  [MXNET-978] Add higher order gradient support `tan`, `tanh` 
(#15253)
     add 90f881d  Fix USE_BLAS == openblas check (#15691)
     add 93708eb  Fix url (404 error) (#15683)
     add 0f28f5b  Add -DMXNET_USE_OPENMP to Makefiles so libinfo gets updated 
accordingly (#15498)
     add f0b6d72  update previous flaky naive engine test (#15651)
     add f2ac85a  update julia install doc (#15609)
     add f0c69f5  Add missing default axis value to symbol.squeeze op (#15707)
     add 776d9c4  add deconv in TRT subgraph (#15666)
     add 9d7fc7c  Fix Scala Symbolic API some/Some typo (#15687)
     add 811d516  Add MKLDNN 4c layout to fix gluoncv se_resnext101_64x4d 
(#15692)
     add 42a47b1  Fix _copy_to on MKLDNN backend (#15637)
     add d5fcd98  [DOC] refine autograd docs (#15109)
     add 773f4dc  Fix quantized concat when inputs are mixed int8 and uint8 
(#15693)
     add b3064c5  [MKLDNN]Enhance Quantization APIs and Tutorial (#15448)
     add 90954ec  make TransposeShape infer shape form both sides (#15713)
     add 4b07386  [MXNET-1358] Fit api tutorial (#15353)
     add 8c641b8  remove mshadow submodule
     add 1434b98  import mshadow source tree
     add b07211f  Skip Flaky Test (#15722)
     add cf28b46  Add magic method `abs` to NDArray and Symbol. (#15680)
     add 87425d2  fix boolean_mask for 0-size output (#15731)
     add 24cce9e  prevent TRT_Logger to be destroyed before TRT engine (#14898)
     add 7c26810  fix dropout mask output (#15697)
     add 3255d87  fix broadcast op param (#15714)
     add 3112893  Dynamic Library Loading Support (#15489)
     add 67acbe5  [DOC] Fix many arguments in the doc: reshape_like, 
arange_like, shape_array (#15752)
     add 8c2bef5  Add Gather_nd Scatter_nd to NDArray API category doc (#15689)
     add 51c07e5  [Dependency Update] Dependency update doc (#15045)
     add ce62873  Add quantization support for GluonCV (#15754)
     add 7186123  Revert "Dynamic Library Loading Support" (#15755)
     add aadef2d  Fix flaky test test_global_metric (#15756)
     add 45db8ea  Add matrix determinant operator in linalg (#15007)
     add 07eb482  fix tvm cmake (#15781)
     add a2b11ae  Fix PR #15489 (Dynamic Library Loading Support) (#15760)
     add be49b3b  Numpy-compatible Infra (#15581)
     add 79d8d86  Fix the bug of `MXEnginePushAsyncND` and `MXEnginePushSyncND` 
(#15751)
     add b6972bb  add int8 bn mkldnn implementation and test (#15664)
     add b77f524  [MXNET-1206] Support NDArray indexing with None and Ellipsis 
(#13143)
     add a3babc4  numpy-compatible sum (#15810)
     add 19916af  fix license issues (#15806)
     add bfd3bb8  Refactor LibraryInitializer so it's thread safe. Fixes random 
sporadical concurrency crashes. (#15762)
     add 0eb213d  Fix backward_clip num inputs and type of clip params (#15688)
     add 44a7fca  [Numpy] Numpy compatible slicing (#15798)
     add 614cba3  Making Features as a singleton for improved caching (#15835)
     add 57927a9  Clojure package remove source images (#15828)
     add c3f5eea  Numpy Tensordot and Dot Operator (#15820)
     add bd32de4  fixing problem with existing Singleton Caching (#15868)
     add 795990b  numpy linspace (#15852)
     add 05f3ae1  Large Index Support for Slice (#15593)
     add 67daae7  tvm infra for op attrs (#15854)
     add c81535c  cuDNN support cleanup (#15812)
     add 39bf4e0  Port several np ops to master (#15867)
     add 11ce2a2  Add large tensor support binary arithmetic (#15785)
     add 24a5cf0  Allow operators with multiple outputs in get_atomic_symbol 
(#15740)
     add f32b58e  numpy-compatible split upstream (#15841)
     add b914d0a  Large tensor support for random ops (#15783)
     new ced2bdb  Merge remote-tracking branch 'origin' into mkldnn-v1.0

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .gitmodules                                        |    3 -
 3rdparty/mshadow                                   |    1 -
 3rdparty/mshadow/.gitignore                        |   21 +
 3rdparty/mshadow/.travis.yml                       |   43 +
 3rdparty/mshadow/CHANGES.md                        |   12 +
 3rdparty/mshadow/CMakeLists.txt                    |    6 +
 3rdparty/mshadow/LICENSE                           |   13 +
 3rdparty/mshadow/README.md                         |   37 +
 3rdparty/mshadow/cmake/Cuda.cmake                  |  324 +++
 3rdparty/mshadow/cmake/Utils.cmake                 |  398 ++++
 3rdparty/mshadow/cmake/mshadow.cmake               |   91 +
 3rdparty/mshadow/cmake/mshadowUtils.cmake          |    2 +
 3rdparty/mshadow/doc/Doxyfile                      | 2358 ++++++++++++++++++++
 3rdparty/mshadow/doc/README.md                     |  321 +++
 3rdparty/mshadow/doc/mkdoc.sh                      |    4 +
 3rdparty/mshadow/guide/.gitignore                  |    3 +
 3rdparty/mshadow/guide/Makefile                    |   37 +
 3rdparty/mshadow/guide/README.md                   |  226 ++
 3rdparty/mshadow/guide/basic.cpp                   |  161 ++
 3rdparty/mshadow/guide/basic_stream.cu             |   35 +
 3rdparty/mshadow/guide/config.mk                   |   39 +
 3rdparty/mshadow/guide/defop.cpp                   |   49 +
 3rdparty/mshadow/guide/exp-template/.gitignore     |    1 +
 3rdparty/mshadow/guide/exp-template/Makefile       |   20 +
 3rdparty/mshadow/guide/exp-template/README.md      |  340 +++
 3rdparty/mshadow/guide/exp-template/exp_lazy.cpp   |   45 +
 .../mshadow/guide/exp-template/exp_template.cpp    |   72 +
 .../mshadow/guide/exp-template/exp_template_op.cpp |   92 +
 3rdparty/mshadow/guide/mshadow-ps/.gitignore       |    4 +
 3rdparty/mshadow/guide/mshadow-ps/2-levels.png     |  Bin 0 -> 59413 bytes
 3rdparty/mshadow/guide/mshadow-ps/Makefile         |   45 +
 3rdparty/mshadow/guide/mshadow-ps/README.md        |  227 ++
 3rdparty/mshadow/guide/mshadow-ps/config.mk        |   40 +
 3rdparty/mshadow/guide/mshadow-ps/dbstr.h          |   35 +
 .../mshadow/guide/mshadow-ps/dist_async_sum-inl.h  |  124 +
 .../mshadow/guide/mshadow-ps/dist_async_sum.cpp    |   11 +
 3rdparty/mshadow/guide/mshadow-ps/local.sh         |   39 +
 3rdparty/mshadow/guide/mshadow-ps/local_sum-inl.h  |  119 +
 3rdparty/mshadow/guide/mshadow-ps/local_sum.cpp    |    4 +
 3rdparty/mshadow/guide/mshadow-ps/local_sum.cu     |    4 +
 3rdparty/mshadow/guide/neuralnet/Makefile          |   38 +
 3rdparty/mshadow/guide/neuralnet/README.md         |   16 +
 3rdparty/mshadow/guide/neuralnet/config.mk         |   35 +
 3rdparty/mshadow/guide/neuralnet/convnet.cu        |  282 +++
 3rdparty/mshadow/guide/neuralnet/nnet.cu           |  202 ++
 3rdparty/mshadow/guide/neuralnet/nnet_ps.cu        |  312 +++
 3rdparty/mshadow/guide/neuralnet/util.h            |   86 +
 3rdparty/mshadow/make/README.md                    |   18 +
 3rdparty/mshadow/make/mshadow.mk                   |  166 ++
 3rdparty/mshadow/mshadow-ps/.gitignore             |    3 +
 3rdparty/mshadow/mshadow-ps/README.md              |    4 +
 3rdparty/mshadow/mshadow-ps/mshadow_ps.h           |  358 +++
 3rdparty/mshadow/mshadow-ps/ps_dist-inl.h          |  126 ++
 3rdparty/mshadow/mshadow-ps/ps_local-inl.h         |  814 +++++++
 3rdparty/mshadow/mshadow-ps/ps_rabit-inl.h         |  113 +
 3rdparty/mshadow/mshadow-ps/thread.h               |  261 +++
 3rdparty/mshadow/mshadow-ps/thread_util.h          |  169 ++
 3rdparty/mshadow/mshadow/README.md                 |    8 +
 3rdparty/mshadow/mshadow/base.h                    | 1110 +++++++++
 3rdparty/mshadow/mshadow/cuda/reduce.cuh           |  120 +
 3rdparty/mshadow/mshadow/cuda/tensor_gpu-inl.cuh   |  828 +++++++
 3rdparty/mshadow/mshadow/dot_engine-inl.h          |  936 ++++++++
 3rdparty/mshadow/mshadow/expr_engine-inl.h         |  482 ++++
 3rdparty/mshadow/mshadow/expr_scalar-inl.h         |  165 ++
 3rdparty/mshadow/mshadow/expression.h              |  416 ++++
 3rdparty/mshadow/mshadow/extension.h               |   41 +
 3rdparty/mshadow/mshadow/extension/broadcast.h     |  165 ++
 .../mshadow/extension/broadcast_with_axis.h        |  258 +++
 3rdparty/mshadow/mshadow/extension/channel_pool.h  |  108 +
 .../mshadow/mshadow/extension/channel_unpool.h     |  137 ++
 3rdparty/mshadow/mshadow/extension/choose.h        |   90 +
 3rdparty/mshadow/mshadow/extension/complex.h       |  525 +++++
 3rdparty/mshadow/mshadow/extension/concat.h        |  194 ++
 3rdparty/mshadow/mshadow/extension/crop.h          |  119 +
 3rdparty/mshadow/mshadow/extension/fill.h          |  103 +
 3rdparty/mshadow/mshadow/extension/flip.h          |  132 ++
 3rdparty/mshadow/mshadow/extension/implicit_gemm.h |  128 ++
 3rdparty/mshadow/mshadow/extension/mask.h          |   97 +
 3rdparty/mshadow/mshadow/extension/mirror.h        |   62 +
 3rdparty/mshadow/mshadow/extension/one_hot.h       |   87 +
 .../mshadow/mshadow/extension/pack_col2patch.h     |  154 ++
 3rdparty/mshadow/mshadow/extension/pad.h           |  111 +
 3rdparty/mshadow/mshadow/extension/range.h         |  118 +
 .../mshadow/mshadow/extension/reduce_with_axis.h   |  136 ++
 3rdparty/mshadow/mshadow/extension/reduceto1d.h    |  104 +
 3rdparty/mshadow/mshadow/extension/reshape.h       |   87 +
 3rdparty/mshadow/mshadow/extension/slice.h         |  156 ++
 3rdparty/mshadow/mshadow/extension/slice_ex.h      |  135 ++
 3rdparty/mshadow/mshadow/extension/spatial_pool.h  |  152 ++
 .../mshadow/mshadow/extension/spatial_unpool.h     |  135 ++
 .../mshadow/extension/spatial_upsampling_nearest.h |   71 +
 3rdparty/mshadow/mshadow/extension/swapaxis.h      |  110 +
 3rdparty/mshadow/mshadow/extension/take.h          |   99 +
 3rdparty/mshadow/mshadow/extension/take_grad.h     |  111 +
 3rdparty/mshadow/mshadow/extension/transpose.h     |  200 ++
 .../mshadow/mshadow/extension/unpack_patch2col.h   |  151 ++
 3rdparty/mshadow/mshadow/half.h                    |  354 +++
 3rdparty/mshadow/mshadow/half2.h                   |  143 ++
 3rdparty/mshadow/mshadow/io.h                      |  137 ++
 3rdparty/mshadow/mshadow/logging.h                 |  234 ++
 3rdparty/mshadow/mshadow/packet-inl.h              |  413 ++++
 3rdparty/mshadow/mshadow/packet/plain-inl.h        |   76 +
 3rdparty/mshadow/mshadow/packet/sse-inl.h          |  147 ++
 3rdparty/mshadow/mshadow/random.h                  |  570 +++++
 3rdparty/mshadow/mshadow/stream_gpu-inl.h          |  214 ++
 3rdparty/mshadow/mshadow/tensor.h                  | 1081 +++++++++
 3rdparty/mshadow/mshadow/tensor_container.h        |  208 ++
 3rdparty/mshadow/mshadow/tensor_cpu-inl.h          |  627 ++++++
 3rdparty/mshadow/mshadow/tensor_gpu-inl.h          |  245 ++
 3rdparty/mshadow/scripts/travis_script.sh          |   19 +
 3rdparty/mshadow/test/Makefile                     |   35 +
 3rdparty/mshadow/test/pairtest.cu                  |  105 +
 3rdparty/mshadow/test/pool.cu                      |   69 +
 3rdparty/mshadow/test/reshape.cu                   |   74 +
 3rdparty/mshadow/test/test.cu                      |   79 +
 3rdparty/mshadow/test/test.h                       |   67 +
 3rdparty/mshadow/test/unpack.cu                    |   85 +
 CMakeLists.txt                                     |   20 +-
 LICENSE                                            |  355 ++-
 Makefile                                           |   10 +-
 benchmark/opperf/rules/default_params.py           |    6 +-
 ci/jenkins/Jenkins_steps.groovy                    |   10 +-
 cmake/BuildTVM.cmake                               |    2 +-
 contrib/clojure-package/.gitignore                 |    1 +
 .../infer/objectdetector/images/marcel.jpg         |  Bin 116161 -> 0 bytes
 .../clojure-package/scripts/get_test_images.sh     |   18 +-
 .../test/org/apache/clojure_mxnet/image_test.clj   |    6 +-
 .../clojure_mxnet/infer/imageclassifier_test.clj   |    5 +-
 .../test/test-images/Pug-Cookie.jpg                |  Bin 104323 -> 0 bytes
 .../clojure-package/test/test-images/kitten.jpg    |  Bin 110969 -> 0 bytes
 contrib/clojure-package/test/test_helper.clj       |   26 +
 contrib/tvmop/compile.py                           |    4 +-
 contrib/tvmop/opdef.py                             |   12 +-
 cpp-package/scripts/OpWrapperGenerator.py          |    1 +
 docs/api/python/autograd/autograd.md               |   75 +-
 docs/api/python/gluon/contrib.md                   |   30 +
 docs/api/python/ndarray/ndarray.md                 |    2 +
 docs/api/python/symbol/linalg.md                   |    2 +
 docs/api/scala/symbol.md                           |    2 +-
 docs/faq/env_var.md                                |   10 +-
 docs/install/index.md                              |   64 +-
 docs/tutorials/gluon/fit_api_tutorial.md           |  271 +++
 docs/tutorials/index.md                            |    5 +-
 docs/tutorials/mkldnn/mkldnn_quantization.md       |  259 +++
 example/README.md                                  |    2 +-
 .../__init__.py => example/lib_api/Makefile        |   16 +-
 example/lib_api/libtest.cc                         |   78 +
 example/lib_api/mylib.cc                           |   37 +
 .../contrib/__init__.py => example/lib_api/test.py |   16 +-
 example/quantization/README.md                     |   95 +-
 example/quantization/imagenet_gen_qsym_mkldnn.py   |  168 +-
 include/mxnet/base.h                               |    4 +-
 include/mxnet/c_api.h                              |  197 +-
 include/mxnet/c_predict_api.h                      |    4 +-
 include/mxnet/lib_api.h                            |   50 +
 include/mxnet/ndarray.h                            |    6 +-
 include/mxnet/tuple.h                              |   15 +
 python/mxnet/__init__.py                           |    6 +
 python/mxnet/_ctypes/ndarray.py                    |   36 +-
 python/mxnet/_ctypes/symbol.py                     |   13 +-
 python/mxnet/_numpy_op_doc.py                      |   54 +
 python/mxnet/autograd.py                           |    3 +
 python/mxnet/base.py                               |  121 +-
 python/mxnet/contrib/amp/lists/symbol.py           |    4 +
 python/mxnet/contrib/quantization.py               |  449 +++-
 python/mxnet/cython/ndarray.pyx                    |   27 +-
 python/mxnet/cython/symbol.pyx                     |   16 +-
 python/mxnet/gluon/block.py                        |   49 +-
 python/mxnet/gluon/contrib/__init__.py             |    2 +
 python/mxnet/gluon/contrib/estimator/__init__.py   |    2 +
 python/mxnet/gluon/contrib/estimator/estimator.py  |   64 +-
 .../mxnet/gluon/contrib/estimator/event_handler.py |   23 +-
 python/mxnet/gluon/data/dataloader.py              |    2 +
 python/mxnet/gluon/parameter.py                    |   52 +-
 python/mxnet/gluon/rnn/rnn_layer.py                |    2 -
 python/mxnet/gluon/utils.py                        |   45 +-
 python/mxnet/image/image.py                        |    2 +-
 python/mxnet/library.py                            |   49 +
 python/mxnet/ndarray/__init__.py                   |    6 +-
 python/mxnet/ndarray/_internal.py                  |   11 +-
 python/mxnet/ndarray/ndarray.py                    | 1218 ++++++----
 .../{gluon/contrib => ndarray/numpy}/__init__.py   |   15 +-
 .../__init__.py => ndarray/numpy/_internal.py}     |    7 +-
 python/mxnet/ndarray/numpy/_op.py                  |  684 ++++++
 .../__init__.py => ndarray/numpy/_register.py}     |   13 +-
 .../__init__.py => ndarray/numpy/linalg.py}        |    9 +-
 .../__init__.py => ndarray/numpy/random.py}        |    8 +-
 .../numpy_extension}/__init__.py                   |   14 +-
 .../__init__.py => ndarray/numpy_extension/_op.py} |   12 +-
 .../numpy_extension/_register.py}                  |   12 +-
 .../numpy_extension/image.py}                      |    7 +-
 python/mxnet/ndarray/register.py                   |  116 +-
 python/mxnet/ndarray/utils.py                      |    7 +
 python/mxnet/{gluon/contrib => numpy}/__init__.py  |   21 +-
 .../contrib/estimator/__init__.py => numpy/_op.py} |    7 +-
 .../contrib/__init__.py => numpy/_register.py}     |   12 +-
 python/mxnet/numpy/arrayprint.py                   |   62 +
 python/mxnet/numpy/function_base.py                |  115 +
 .../test_runtime.py => python/mxnet/numpy/io.py    |   45 +-
 .../estimator/__init__.py => numpy/linalg.py}      |    9 +-
 python/mxnet/numpy/multiarray.py                   | 1855 +++++++++++++++
 .../estimator/__init__.py => numpy/random.py}      |    9 +-
 python/mxnet/numpy/stride_tricks.py                |   56 +
 .../{gluon/contrib/__init__.py => numpy/utils.py}  |   20 +-
 .../{gluon/contrib => numpy_extension}/__init__.py |   21 +-
 .../__init__.py => numpy_extension/_op.py}         |    7 +-
 .../__init__.py => numpy_extension/_register.py}   |   12 +-
 .../__init__.py => numpy_extension/image.py}       |   11 +-
 python/mxnet/numpy_extension/utils.py              |  122 +
 python/mxnet/optimizer/optimizer.py                |   34 +-
 python/mxnet/runtime.py                            |    8 +-
 python/mxnet/symbol/__init__.py                    |    7 +-
 python/mxnet/symbol/_internal.py                   |   10 +-
 .../{gluon/contrib => symbol/numpy}/__init__.py    |   17 +-
 .../__init__.py => symbol/numpy/_internal.py}      |    7 +-
 .../estimator/__init__.py => symbol/numpy/_op.py}  |    7 +-
 .../__init__.py => symbol/numpy/_register.py}      |   13 +-
 python/mxnet/symbol/numpy/_symbol.py               | 1315 +++++++++++
 .../__init__.py => symbol/numpy/linalg.py}         |    9 +-
 .../__init__.py => symbol/numpy/random.py}         |    9 +-
 .../contrib => symbol/numpy_extension}/__init__.py |   14 +-
 .../__init__.py => symbol/numpy_extension/_op.py}  |   12 +-
 .../numpy_extension/_register.py}                  |   13 +-
 .../numpy_extension/image.py}                      |    7 +-
 python/mxnet/symbol/register.py                    |   84 +-
 python/mxnet/symbol/symbol.py                      |  119 +-
 python/mxnet/test_utils.py                         |   62 +-
 python/mxnet/util.py                               |  395 +++-
 src/c_api/c_api.cc                                 |  175 +-
 src/c_api/c_api_common.h                           |   15 +-
 src/c_api/c_api_executor.cc                        |   12 +-
 src/c_api/c_api_ndarray.cc                         |   10 +-
 src/c_api/c_api_profile.cc                         |    2 +-
 src/c_api/c_api_symbolic.cc                        |  228 +-
 src/common/cuda_utils.h                            |   16 +-
 src/common/utils.h                                 |   12 +
 src/engine/threaded_engine_perdevice.cc            |    4 +-
 src/executor/attach_op_resource_pass.cc            |    4 +-
 src/imperative/imperative.cc                       |    8 +-
 src/imperative/imperative_utils.h                  |   41 +-
 src/initialize.cc                                  |  241 +-
 src/initialize.h                                   |  126 ++
 src/io/image_io.cc                                 |    3 +
 src/ndarray/ndarray.cc                             |   24 +-
 src/operator/bilinear_sampler.cu                   |    8 +-
 src/operator/contrib/boolean_mask.cc               |    1 +
 src/operator/contrib/boolean_mask.cu               |   15 +-
 src/operator/contrib/multibox_detection.cc         |    4 +
 src/operator/contrib/multibox_prior.cc             |    3 +
 src/operator/contrib/multibox_target.cc            |    4 +
 src/operator/contrib/tvmop/ufunc.cc                |    4 +-
 src/operator/cudnn_bilinear_sampler-inl.h          |    9 +-
 src/operator/cudnn_spatial_transformer-inl.h       |    9 +-
 src/operator/image/crop.cc                         |    1 +
 src/operator/image/image_random.cc                 |   13 +
 src/operator/image/resize.cc                       |    1 +
 src/operator/leaky_relu.cc                         |    1 +
 src/operator/linalg.h                              |   50 +-
 src/operator/linalg_impl.h                         |  243 +-
 src/operator/nn/activation.cc                      |    1 +
 src/operator/nn/batch_norm-inl.h                   |   52 +-
 src/operator/nn/batch_norm.cc                      |    1 +
 src/operator/nn/batch_norm.cu                      |    8 +-
 src/operator/nn/concat.cc                          |   13 +-
 src/operator/nn/convolution.cc                     |    1 +
 src/operator/nn/convolution.cu                     |   99 +-
 src/operator/nn/cudnn/cudnn_activation-inl.h       |   37 +-
 src/operator/nn/cudnn/cudnn_algoreg-inl.h          |    4 +-
 src/operator/nn/cudnn/cudnn_batch_norm-inl.h       |   46 +-
 src/operator/nn/cudnn/cudnn_batch_norm.cc          |    4 +-
 src/operator/nn/cudnn/cudnn_batch_norm.cu          |   90 -
 src/operator/nn/cudnn/cudnn_convolution-inl.h      |  354 +--
 src/operator/nn/cudnn/cudnn_deconvolution-inl.h    |  246 +-
 src/operator/nn/cudnn/cudnn_pooling-inl.h          |   29 +-
 src/operator/nn/deconvolution.cc                   |    1 +
 src/operator/nn/dropout-inl.h                      |   17 +-
 src/operator/nn/dropout.cc                         |    6 +-
 src/operator/nn/fully_connected.cc                 |    1 +
 src/operator/nn/layer_norm.cc                      |    1 +
 src/operator/nn/mkldnn/mkldnn_base.cc              |    4 +
 src/operator/nn/mkldnn/mkldnn_batch_norm-inl.h     |   29 +-
 src/operator/nn/mkldnn/mkldnn_transpose.cc         |    3 +-
 src/operator/nn/pooling.cc                         |    3 +-
 src/operator/nn/softmax.cc                         |    2 +
 src/operator/numpy/np_broadcast_reduce_op.h        |  278 +++
 src/operator/numpy/np_broadcast_reduce_op_value.cc |  154 ++
 src/operator/numpy/np_broadcast_reduce_op_value.cu |   48 +
 src/operator/numpy/np_dot-inl.h                    |  110 +
 src/operator/numpy/np_dot.cc                       |  150 ++
 src/operator/numpy/np_dot.cu                       |   37 +
 src/operator/numpy/np_elemwise_broadcast_op.cc     |  186 ++
 src/operator/numpy/np_elemwise_broadcast_op.cu     |   82 +
 src/operator/numpy/np_init_op.cc                   |  114 +
 src/operator/numpy/np_init_op.cu                   |   47 +
 src/operator/numpy/np_matrix_op-inl.h              |   65 +
 src/operator/numpy/np_matrix_op.cc                 |  252 +++
 src/operator/numpy/np_matrix_op.cu                 |   41 +
 src/operator/numpy/np_tensordot_op-inl.h           |  688 ++++++
 src/operator/numpy/np_tensordot_op.cc              |  235 ++
 src/operator/numpy/np_tensordot_op.cu              |   42 +
 src/operator/numpy/np_true_divide.cc               |  127 ++
 src/operator/numpy/np_true_divide.cu               |   41 +
 .../mkldnn/mkldnn_quantized_batch_norm.cc          |  123 +
 .../quantization/mkldnn/mkldnn_quantized_concat.cc |   12 +-
 src/operator/quantization/quantize_graph_pass.cc   |   42 +-
 src/operator/quantization/quantized_batch_norm.cc  |  137 ++
 src/operator/quantization/quantized_concat.cc      |   12 +-
 src/operator/quantization/quantized_conv.cu        |    9 +-
 src/operator/quantization/quantized_pooling.cu     |    9 +-
 src/operator/random/sample_op.cc                   |    2 +
 src/operator/random/shuffle_op.cc                  |    1 +
 src/operator/rnn-inl.h                             |  162 +-
 src/operator/rnn.cc                                |    3 +-
 src/operator/roi_pooling.cc                        |    4 +
 src/operator/sequence_mask.cc                      |    3 +
 src/operator/spatial_transformer-inl.h             |    2 +-
 src/operator/spatial_transformer.cu                |    8 +-
 src/operator/special_functions-inl.h               |   19 +
 src/operator/subgraph/tensorrt/nnvm_to_onnx-inl.h  |   19 +-
 src/operator/subgraph/tensorrt/nnvm_to_onnx.cc     |   46 +-
 src/operator/subgraph/tensorrt/onnx_to_tensorrt.cc |   35 +-
 src/operator/subgraph/tensorrt/onnx_to_tensorrt.h  |   66 +-
 src/operator/subgraph/tensorrt/tensorrt-inl.h      |   27 +-
 src/operator/subgraph/tensorrt/tensorrt.cc         |    4 +-
 src/operator/swapaxis-inl.h                        |   42 +-
 src/operator/swapaxis.cc                           |    2 +-
 src/operator/tensor/broadcast_reduce_op.h          |  183 +-
 src/operator/tensor/broadcast_reduce_op_index.cc   |    1 +
 src/operator/tensor/dot.cc                         |    1 +
 src/operator/tensor/elemwise_binary_broadcast_op.h |    1 +
 .../tensor/elemwise_binary_broadcast_op_logic.cc   |    6 +
 src/operator/tensor/elemwise_binary_op.h           |   16 +-
 .../tensor/elemwise_binary_scalar_op_extended.cc   |    3 +-
 .../tensor/elemwise_binary_scalar_op_logic.cc      |    6 +
 src/operator/tensor/elemwise_unary_op.h            |    6 +-
 src/operator/tensor/elemwise_unary_op_basic.cc     |    9 +-
 src/operator/tensor/elemwise_unary_op_trig.cc      |   57 +-
 src/operator/tensor/indexing_op.cc                 |    2 +
 src/operator/tensor/init_op.cc                     |    4 +-
 src/operator/tensor/la_op-inl.h                    |  136 +-
 src/operator/tensor/la_op.cc                       |  166 +-
 src/operator/tensor/la_op.cu                       |   12 +
 src/operator/tensor/la_op.h                        |  180 ++
 src/operator/tensor/matrix_op-inl.h                |  171 +-
 src/operator/tensor/matrix_op.cc                   |   15 +-
 src/operator/tensor/ordering_op.cc                 |    1 +
 src/operator/tensor/slice-inl.h                    |    6 +-
 src/profiler/profiler.h                            |   15 +-
 tests/cpp/engine/threaded_engine_test.cc           |  117 +-
 tests/nightly/test_large_array.py                  |  195 +-
 .../test_large_vector.py}                          |   69 +-
 tests/python/gpu/test_operator_gpu.py              |    3 +
 tests/python/mkl/test_quantization_mkldnn.py       |    5 +-
 tests/python/mkl/test_subgraph.py                  |   11 +
 tests/python/quantization/test_quantization.py     |  136 ++
 tests/python/tensorrt/test_tensorrt_deconv.py      |   63 +
 tests/python/unittest/test_dgl_graph.py            |    2 +-
 tests/python/unittest/test_gluon.py                |   24 +-
 tests/python/unittest/test_gluon_estimator.py      |    7 +-
 tests/python/unittest/test_higher_order_grad.py    |   40 +-
 tests/python/unittest/test_library_loading.py      |   48 +
 tests/python/unittest/test_metric.py               |   18 +-
 tests/python/unittest/test_ndarray.py              |  131 +-
 tests/python/unittest/test_numpy_ndarray.py        |  672 ++++++
 tests/python/unittest/test_numpy_op.py             |  834 +++++++
 tests/python/unittest/test_operator.py             |  114 +-
 tests/python/unittest/test_profiler.py             |   87 +-
 tests/python/unittest/test_runtime.py              |    9 +
 tests/python/unittest/test_symbol.py               |   27 +-
 tests/python/unittest/test_thread_local.py         |   36 +
 tests/tutorials/test_tutorials.py                  |    6 +
 tools/dependencies/README.md                       |  298 +++
 372 files changed, 38271 insertions(+), 2805 deletions(-)
 delete mode 160000 3rdparty/mshadow
 create mode 100644 3rdparty/mshadow/.gitignore
 create mode 100644 3rdparty/mshadow/.travis.yml
 create mode 100644 3rdparty/mshadow/CHANGES.md
 create mode 100644 3rdparty/mshadow/CMakeLists.txt
 create mode 100644 3rdparty/mshadow/LICENSE
 create mode 100644 3rdparty/mshadow/README.md
 create mode 100644 3rdparty/mshadow/cmake/Cuda.cmake
 create mode 100644 3rdparty/mshadow/cmake/Utils.cmake
 create mode 100644 3rdparty/mshadow/cmake/mshadow.cmake
 create mode 100644 3rdparty/mshadow/cmake/mshadowUtils.cmake
 create mode 100644 3rdparty/mshadow/doc/Doxyfile
 create mode 100644 3rdparty/mshadow/doc/README.md
 create mode 100755 3rdparty/mshadow/doc/mkdoc.sh
 create mode 100644 3rdparty/mshadow/guide/.gitignore
 create mode 100644 3rdparty/mshadow/guide/Makefile
 create mode 100644 3rdparty/mshadow/guide/README.md
 create mode 100644 3rdparty/mshadow/guide/basic.cpp
 create mode 100644 3rdparty/mshadow/guide/basic_stream.cu
 create mode 100644 3rdparty/mshadow/guide/config.mk
 create mode 100644 3rdparty/mshadow/guide/defop.cpp
 create mode 100644 3rdparty/mshadow/guide/exp-template/.gitignore
 create mode 100644 3rdparty/mshadow/guide/exp-template/Makefile
 create mode 100644 3rdparty/mshadow/guide/exp-template/README.md
 create mode 100644 3rdparty/mshadow/guide/exp-template/exp_lazy.cpp
 create mode 100644 3rdparty/mshadow/guide/exp-template/exp_template.cpp
 create mode 100644 3rdparty/mshadow/guide/exp-template/exp_template_op.cpp
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/.gitignore
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/2-levels.png
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/Makefile
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/README.md
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/config.mk
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/dbstr.h
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/dist_async_sum-inl.h
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/dist_async_sum.cpp
 create mode 100755 3rdparty/mshadow/guide/mshadow-ps/local.sh
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/local_sum-inl.h
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/local_sum.cpp
 create mode 100644 3rdparty/mshadow/guide/mshadow-ps/local_sum.cu
 create mode 100644 3rdparty/mshadow/guide/neuralnet/Makefile
 create mode 100644 3rdparty/mshadow/guide/neuralnet/README.md
 create mode 100644 3rdparty/mshadow/guide/neuralnet/config.mk
 create mode 100644 3rdparty/mshadow/guide/neuralnet/convnet.cu
 create mode 100644 3rdparty/mshadow/guide/neuralnet/nnet.cu
 create mode 100644 3rdparty/mshadow/guide/neuralnet/nnet_ps.cu
 create mode 100644 3rdparty/mshadow/guide/neuralnet/util.h
 create mode 100644 3rdparty/mshadow/make/README.md
 create mode 100644 3rdparty/mshadow/make/mshadow.mk
 create mode 100644 3rdparty/mshadow/mshadow-ps/.gitignore
 create mode 100644 3rdparty/mshadow/mshadow-ps/README.md
 create mode 100644 3rdparty/mshadow/mshadow-ps/mshadow_ps.h
 create mode 100644 3rdparty/mshadow/mshadow-ps/ps_dist-inl.h
 create mode 100644 3rdparty/mshadow/mshadow-ps/ps_local-inl.h
 create mode 100644 3rdparty/mshadow/mshadow-ps/ps_rabit-inl.h
 create mode 100644 3rdparty/mshadow/mshadow-ps/thread.h
 create mode 100644 3rdparty/mshadow/mshadow-ps/thread_util.h
 create mode 100644 3rdparty/mshadow/mshadow/README.md
 create mode 100755 3rdparty/mshadow/mshadow/base.h
 create mode 100644 3rdparty/mshadow/mshadow/cuda/reduce.cuh
 create mode 100755 3rdparty/mshadow/mshadow/cuda/tensor_gpu-inl.cuh
 create mode 100644 3rdparty/mshadow/mshadow/dot_engine-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/expr_engine-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/expr_scalar-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/expression.h
 create mode 100644 3rdparty/mshadow/mshadow/extension.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/broadcast.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/broadcast_with_axis.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/channel_pool.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/channel_unpool.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/choose.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/complex.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/concat.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/crop.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/fill.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/flip.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/implicit_gemm.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/mask.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/mirror.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/one_hot.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/pack_col2patch.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/pad.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/range.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/reduce_with_axis.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/reduceto1d.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/reshape.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/slice.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/slice_ex.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/spatial_pool.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/spatial_unpool.h
 create mode 100644 
3rdparty/mshadow/mshadow/extension/spatial_upsampling_nearest.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/swapaxis.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/take.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/take_grad.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/transpose.h
 create mode 100644 3rdparty/mshadow/mshadow/extension/unpack_patch2col.h
 create mode 100644 3rdparty/mshadow/mshadow/half.h
 create mode 100755 3rdparty/mshadow/mshadow/half2.h
 create mode 100644 3rdparty/mshadow/mshadow/io.h
 create mode 100644 3rdparty/mshadow/mshadow/logging.h
 create mode 100644 3rdparty/mshadow/mshadow/packet-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/packet/plain-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/packet/sse-inl.h
 create mode 100644 3rdparty/mshadow/mshadow/random.h
 create mode 100644 3rdparty/mshadow/mshadow/stream_gpu-inl.h
 create mode 100755 3rdparty/mshadow/mshadow/tensor.h
 create mode 100644 3rdparty/mshadow/mshadow/tensor_container.h
 create mode 100755 3rdparty/mshadow/mshadow/tensor_cpu-inl.h
 create mode 100755 3rdparty/mshadow/mshadow/tensor_gpu-inl.h
 create mode 100755 3rdparty/mshadow/scripts/travis_script.sh
 create mode 100644 3rdparty/mshadow/test/Makefile
 create mode 100644 3rdparty/mshadow/test/pairtest.cu
 create mode 100644 3rdparty/mshadow/test/pool.cu
 create mode 100644 3rdparty/mshadow/test/reshape.cu
 create mode 100644 3rdparty/mshadow/test/test.cu
 create mode 100644 3rdparty/mshadow/test/test.h
 create mode 100644 3rdparty/mshadow/test/unpack.cu
 delete mode 100644 
contrib/clojure-package/examples/infer/objectdetector/images/marcel.jpg
 copy python/mxnet/gluon/contrib/__init__.py => 
contrib/clojure-package/scripts/get_test_images.sh (66%)
 mode change 100644 => 100755
 delete mode 100644 contrib/clojure-package/test/test-images/Pug-Cookie.jpg
 delete mode 100644 contrib/clojure-package/test/test-images/kitten.jpg
 create mode 100644 contrib/clojure-package/test/test_helper.clj
 create mode 100644 docs/tutorials/gluon/fit_api_tutorial.md
 create mode 100644 docs/tutorials/mkldnn/mkldnn_quantization.md
 copy python/mxnet/gluon/contrib/__init__.py => example/lib_api/Makefile (77%)
 create mode 100644 example/lib_api/libtest.cc
 create mode 100644 example/lib_api/mylib.cc
 copy python/mxnet/gluon/contrib/__init__.py => example/lib_api/test.py (74%)
 create mode 100644 include/mxnet/lib_api.h
 create mode 100644 python/mxnet/_numpy_op_doc.py
 create mode 100644 python/mxnet/library.py
 copy python/mxnet/{gluon/contrib => ndarray/numpy}/__init__.py (78%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
ndarray/numpy/_internal.py} (87%)
 create mode 100644 python/mxnet/ndarray/numpy/_op.py
 copy python/mxnet/{gluon/contrib/__init__.py => ndarray/numpy/_register.py} 
(64%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
ndarray/numpy/linalg.py} (86%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
ndarray/numpy/random.py} (86%)
 copy python/mxnet/{gluon/contrib => ndarray/numpy_extension}/__init__.py (79%)
 copy python/mxnet/{gluon/contrib/__init__.py => 
ndarray/numpy_extension/_op.py} (85%)
 copy python/mxnet/{gluon/contrib/__init__.py => 
ndarray/numpy_extension/_register.py} (73%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
ndarray/numpy_extension/image.py} (87%)
 copy python/mxnet/{gluon/contrib => numpy}/__init__.py (56%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => numpy/_op.py} (87%)
 copy python/mxnet/{gluon/contrib/__init__.py => numpy/_register.py} (69%)
 create mode 100644 python/mxnet/numpy/arrayprint.py
 create mode 100644 python/mxnet/numpy/function_base.py
 copy tests/python/unittest/test_runtime.py => python/mxnet/numpy/io.py (50%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => numpy/linalg.py} 
(87%)
 create mode 100644 python/mxnet/numpy/multiarray.py
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => numpy/random.py} 
(87%)
 create mode 100644 python/mxnet/numpy/stride_tricks.py
 copy python/mxnet/{gluon/contrib/__init__.py => numpy/utils.py} (70%)
 copy python/mxnet/{gluon/contrib => numpy_extension}/__init__.py (61%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
numpy_extension/_op.py} (87%)
 copy python/mxnet/{gluon/contrib/__init__.py => numpy_extension/_register.py} 
(68%)
 copy python/mxnet/{gluon/contrib/__init__.py => numpy_extension/image.py} (85%)
 create mode 100644 python/mxnet/numpy_extension/utils.py
 copy python/mxnet/{gluon/contrib => symbol/numpy}/__init__.py (70%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
symbol/numpy/_internal.py} (87%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => symbol/numpy/_op.py} 
(87%)
 copy python/mxnet/{gluon/contrib/__init__.py => symbol/numpy/_register.py} 
(65%)
 create mode 100644 python/mxnet/symbol/numpy/_symbol.py
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
symbol/numpy/linalg.py} (86%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
symbol/numpy/random.py} (86%)
 copy python/mxnet/{gluon/contrib => symbol/numpy_extension}/__init__.py (79%)
 copy python/mxnet/{gluon/contrib/__init__.py => symbol/numpy_extension/_op.py} 
(85%)
 copy python/mxnet/{gluon/contrib/__init__.py => 
symbol/numpy_extension/_register.py} (73%)
 copy python/mxnet/{gluon/contrib/estimator/__init__.py => 
symbol/numpy_extension/image.py} (87%)
 create mode 100644 src/initialize.h
 delete mode 100644 src/operator/nn/cudnn/cudnn_batch_norm.cu
 create mode 100644 src/operator/numpy/np_broadcast_reduce_op.h
 create mode 100644 src/operator/numpy/np_broadcast_reduce_op_value.cc
 create mode 100644 src/operator/numpy/np_broadcast_reduce_op_value.cu
 create mode 100644 src/operator/numpy/np_dot-inl.h
 create mode 100644 src/operator/numpy/np_dot.cc
 create mode 100644 src/operator/numpy/np_dot.cu
 create mode 100644 src/operator/numpy/np_elemwise_broadcast_op.cc
 create mode 100644 src/operator/numpy/np_elemwise_broadcast_op.cu
 create mode 100644 src/operator/numpy/np_init_op.cc
 create mode 100644 src/operator/numpy/np_init_op.cu
 create mode 100644 src/operator/numpy/np_matrix_op-inl.h
 create mode 100644 src/operator/numpy/np_matrix_op.cc
 create mode 100644 src/operator/numpy/np_matrix_op.cu
 create mode 100644 src/operator/numpy/np_tensordot_op-inl.h
 create mode 100644 src/operator/numpy/np_tensordot_op.cc
 create mode 100644 src/operator/numpy/np_tensordot_op.cu
 create mode 100644 src/operator/numpy/np_true_divide.cc
 create mode 100644 src/operator/numpy/np_true_divide.cu
 create mode 100644 
src/operator/quantization/mkldnn/mkldnn_quantized_batch_norm.cc
 create mode 100644 src/operator/quantization/quantized_batch_norm.cc
 copy tests/{python/mkl/test_quantization_mkldnn.py => 
nightly/test_large_vector.py} (67%)
 create mode 100644 tests/python/tensorrt/test_tensorrt_deconv.py
 create mode 100644 tests/python/unittest/test_library_loading.py
 create mode 100644 tests/python/unittest/test_numpy_ndarray.py
 create mode 100644 tests/python/unittest/test_numpy_op.py

Reply via email to