DickJC123 opened a new pull request #20876: URL: https://github.com/apache/incubator-mxnet/pull/20876
## Description ## @josephevans is in the process of adding a g5 instance to the CI, for MXNet testing on A100. This PR will first enable the CI on the g5 instance, which will expose the need for some test tolerance adjustments, since A100 uses reduced-mantissa-width TF32 calculations by default on float32 datasets. I will then add the fixing commits to this PR to get a clean CI before merging. See the related PR https://github.com/apache/incubator-mxnet-ci/pull/43 [not yet merged] ## Checklist ## ### Essentials ### - [ X] PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc) - [ ] Changes are complete (i.e. I finished coding on this PR) - [ ] All changes have test coverage - [ ] Code is well-documented ### Changes ### - [ ] Feature1, tests, (and when applicable, API doc) - [ ] Feature2, tests, (and when applicable, API doc) ## Comments ## - If this change is a backward incompatible change, why must this change be made. - Interesting edge cases to note here -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
