[incubator-mxnet] branch benchmark updated (c0560fc -> 579b9dd)

2019-11-06 Thread apeforest
This is an automated email from the ASF dual-hosted git repository.

apeforest pushed a change to branch benchmark
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


from c0560fc  add log message and TODO
 add 77beeb6  add cutlass as 3rdparty dependency
 add 6728de8  add cutlass to compilation flags
 add 543098b  remove all cutlass stuff
 add 9b5ee9a  add better error message and description and remove cutlass 
from compilation flags
 add 7ef32ee  change credit for the approach since the code have changed
 add 3115fb1  fix typos
 add 67f5aa9  correct another typo
 add 0485ed2  Add all the cuda/cublas helper functions
 add 7f5194d  remove tests using kAddTo
 add d4ffa4e  only use cublasStridedBatchedGemm if CUDA >= 9.1
 add e88fa4b  add equivalent mxnet code in description of mha ops
 add e76c38a  remove a wrong copy-paste
 add 0b25025  add _contrib for namespace and add GPU only on description
 add f8bd1cb  add warning in bwd_ignore_zero_init description, also test 
with fp32
 add 68ded77  add error return if bwd_ignore_zero_init is used without 
MXNET_EXEC_ENABLE_ADDTO
 add a93ad94  remove std::move for clang
 add 95cb2fd  remove bwd_ignore_zero_init flag
 add cec1ab2  remove bwd_ignore_zero_init in test_operator_gpu.py
 add 344f3fd  fix typo
 add 579b9dd  fix another typo

No new revisions were added by this update.

Summary of changes:
 src/common/cuda_utils.h|  74 +
 src/operator/contrib/transformer-inl.h |   9 +
 src/operator/contrib/transformer.cc| 270 
 src/operator/contrib/transformer.cu| 554 +
 tests/python/gpu/test_operator_gpu.py  | 314 +++
 5 files changed, 1221 insertions(+)



[incubator-mxnet] branch benchmark updated (c0560fc -> 579b9dd)

2019-11-06 Thread apeforest
This is an automated email from the ASF dual-hosted git repository.

apeforest pushed a change to branch benchmark
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


from c0560fc  add log message and TODO
 add 77beeb6  add cutlass as 3rdparty dependency
 add 6728de8  add cutlass to compilation flags
 add 543098b  remove all cutlass stuff
 add 9b5ee9a  add better error message and description and remove cutlass 
from compilation flags
 add 7ef32ee  change credit for the approach since the code have changed
 add 3115fb1  fix typos
 add 67f5aa9  correct another typo
 add 0485ed2  Add all the cuda/cublas helper functions
 add 7f5194d  remove tests using kAddTo
 add d4ffa4e  only use cublasStridedBatchedGemm if CUDA >= 9.1
 add e88fa4b  add equivalent mxnet code in description of mha ops
 add e76c38a  remove a wrong copy-paste
 add 0b25025  add _contrib for namespace and add GPU only on description
 add f8bd1cb  add warning in bwd_ignore_zero_init description, also test 
with fp32
 add 68ded77  add error return if bwd_ignore_zero_init is used without 
MXNET_EXEC_ENABLE_ADDTO
 add a93ad94  remove std::move for clang
 add 95cb2fd  remove bwd_ignore_zero_init flag
 add cec1ab2  remove bwd_ignore_zero_init in test_operator_gpu.py
 add 344f3fd  fix typo
 add 579b9dd  fix another typo

No new revisions were added by this update.

Summary of changes:
 src/common/cuda_utils.h|  74 +
 src/operator/contrib/transformer-inl.h |   9 +
 src/operator/contrib/transformer.cc| 270 
 src/operator/contrib/transformer.cu| 554 +
 tests/python/gpu/test_operator_gpu.py  | 314 +++
 5 files changed, 1221 insertions(+)



[incubator-mxnet] branch benchmark updated (c0560fc -> 579b9dd)

2019-11-06 Thread apeforest
This is an automated email from the ASF dual-hosted git repository.

apeforest pushed a change to branch benchmark
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


from c0560fc  add log message and TODO
 add 77beeb6  add cutlass as 3rdparty dependency
 add 6728de8  add cutlass to compilation flags
 add 543098b  remove all cutlass stuff
 add 9b5ee9a  add better error message and description and remove cutlass 
from compilation flags
 add 7ef32ee  change credit for the approach since the code have changed
 add 3115fb1  fix typos
 add 67f5aa9  correct another typo
 add 0485ed2  Add all the cuda/cublas helper functions
 add 7f5194d  remove tests using kAddTo
 add d4ffa4e  only use cublasStridedBatchedGemm if CUDA >= 9.1
 add e88fa4b  add equivalent mxnet code in description of mha ops
 add e76c38a  remove a wrong copy-paste
 add 0b25025  add _contrib for namespace and add GPU only on description
 add f8bd1cb  add warning in bwd_ignore_zero_init description, also test 
with fp32
 add 68ded77  add error return if bwd_ignore_zero_init is used without 
MXNET_EXEC_ENABLE_ADDTO
 add a93ad94  remove std::move for clang
 add 95cb2fd  remove bwd_ignore_zero_init flag
 add cec1ab2  remove bwd_ignore_zero_init in test_operator_gpu.py
 add 344f3fd  fix typo
 add 579b9dd  fix another typo

No new revisions were added by this update.

Summary of changes:
 src/common/cuda_utils.h|  74 +
 src/operator/contrib/transformer-inl.h |   9 +
 src/operator/contrib/transformer.cc| 270 
 src/operator/contrib/transformer.cu| 554 +
 tests/python/gpu/test_operator_gpu.py  | 314 +++
 5 files changed, 1221 insertions(+)



[incubator-mxnet] branch benchmark updated (c0560fc -> 579b9dd)

2019-11-06 Thread apeforest
This is an automated email from the ASF dual-hosted git repository.

apeforest pushed a change to branch benchmark
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


from c0560fc  add log message and TODO
 add 77beeb6  add cutlass as 3rdparty dependency
 add 6728de8  add cutlass to compilation flags
 add 543098b  remove all cutlass stuff
 add 9b5ee9a  add better error message and description and remove cutlass 
from compilation flags
 add 7ef32ee  change credit for the approach since the code have changed
 add 3115fb1  fix typos
 add 67f5aa9  correct another typo
 add 0485ed2  Add all the cuda/cublas helper functions
 add 7f5194d  remove tests using kAddTo
 add d4ffa4e  only use cublasStridedBatchedGemm if CUDA >= 9.1
 add e88fa4b  add equivalent mxnet code in description of mha ops
 add e76c38a  remove a wrong copy-paste
 add 0b25025  add _contrib for namespace and add GPU only on description
 add f8bd1cb  add warning in bwd_ignore_zero_init description, also test 
with fp32
 add 68ded77  add error return if bwd_ignore_zero_init is used without 
MXNET_EXEC_ENABLE_ADDTO
 add a93ad94  remove std::move for clang
 add 95cb2fd  remove bwd_ignore_zero_init flag
 add cec1ab2  remove bwd_ignore_zero_init in test_operator_gpu.py
 add 344f3fd  fix typo
 add 579b9dd  fix another typo

No new revisions were added by this update.

Summary of changes:
 src/common/cuda_utils.h|  74 +
 src/operator/contrib/transformer-inl.h |   9 +
 src/operator/contrib/transformer.cc| 270 
 src/operator/contrib/transformer.cu| 554 +
 tests/python/gpu/test_operator_gpu.py  | 314 +++
 5 files changed, 1221 insertions(+)



[incubator-mxnet] branch benchmark updated (c0560fc -> 579b9dd)

2019-11-06 Thread apeforest
This is an automated email from the ASF dual-hosted git repository.

apeforest pushed a change to branch benchmark
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.


from c0560fc  add log message and TODO
 add 77beeb6  add cutlass as 3rdparty dependency
 add 6728de8  add cutlass to compilation flags
 add 543098b  remove all cutlass stuff
 add 9b5ee9a  add better error message and description and remove cutlass 
from compilation flags
 add 7ef32ee  change credit for the approach since the code have changed
 add 3115fb1  fix typos
 add 67f5aa9  correct another typo
 add 0485ed2  Add all the cuda/cublas helper functions
 add 7f5194d  remove tests using kAddTo
 add d4ffa4e  only use cublasStridedBatchedGemm if CUDA >= 9.1
 add e88fa4b  add equivalent mxnet code in description of mha ops
 add e76c38a  remove a wrong copy-paste
 add 0b25025  add _contrib for namespace and add GPU only on description
 add f8bd1cb  add warning in bwd_ignore_zero_init description, also test 
with fp32
 add 68ded77  add error return if bwd_ignore_zero_init is used without 
MXNET_EXEC_ENABLE_ADDTO
 add a93ad94  remove std::move for clang
 add 95cb2fd  remove bwd_ignore_zero_init flag
 add cec1ab2  remove bwd_ignore_zero_init in test_operator_gpu.py
 add 344f3fd  fix typo
 add 579b9dd  fix another typo

No new revisions were added by this update.

Summary of changes:
 src/common/cuda_utils.h|  74 +
 src/operator/contrib/transformer-inl.h |   9 +
 src/operator/contrib/transformer.cc| 270 
 src/operator/contrib/transformer.cu| 554 +
 tests/python/gpu/test_operator_gpu.py  | 314 +++
 5 files changed, 1221 insertions(+)