[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-25 Thread Artem Belevich via Phabricator via cfe-commits
This revision was automatically updated to reflect the committed changes. Closed by commit rC359248: [CUDA] Implemented _[bi]mma* builtins. (authored by tra, committed by ). Herald added a subscriber: kristina. Herald added a project: clang. Changed prior to commit:

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-08 Thread Artem Belevich via Phabricator via cfe-commits
tra updated this revision to Diff 194226. tra added a comment. - Converted class to struct+function as Tim suggested. CHANGES SINCE LAST ACTION https://reviews.llvm.org/D60279/new/ https://reviews.llvm.org/D60279 Files: clang/include/clang/Basic/BuiltinsNVPTX.def

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-05 Thread Tim Shen via Phabricator via cfe-commits
timshen added inline comments. Comment at: clang/lib/CodeGen/CGBuiltin.cpp:12884 +// Helper classes for mapping MMA builtins to particular LLVM intrinsic variant. +class NVPTXMmaLdstInfo { +public: How about having a simple struct and a function? ``` struct

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-04 Thread Artem Belevich via Phabricator via cfe-commits
tra updated this revision to Diff 193809. tra added a comment. - Added PTX64 to the list of builtins' constraints. CHANGES SINCE LAST ACTION https://reviews.llvm.org/D60279/new/ https://reviews.llvm.org/D60279 Files: clang/include/clang/Basic/BuiltinsNVPTX.def

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-04 Thread Artem Belevich via Phabricator via cfe-commits
tra updated this revision to Diff 193796. tra added a comment. - Fixed minor issues with parameters of the new builtins: - __imma*_st_c_i32 builtins have 'const int * src' - __bmma_m8n8k128_mma_xor_popc_b1 does not have 'satf' argument. CHANGES SINCE LAST ACTION

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-04 Thread Artem Belevich via Phabricator via cfe-commits
tra updated this revision to Diff 193774. tra edited the summary of this revision. tra added a comment. Cleaned up mma test generation. CHANGES SINCE LAST ACTION https://reviews.llvm.org/D60279/new/ https://reviews.llvm.org/D60279 Files: clang/include/clang/Basic/BuiltinsNVPTX.def

[PATCH] D60279: [CUDA] Implemented _[bi]mma* builtins.

2019-04-04 Thread Artem Belevich via Phabricator via cfe-commits
tra created this revision. tra added reviewers: timshen, jlebar. Herald added subscribers: llvm-commits, bixia, hiraditya, jholewinski. Herald added a project: LLVM. These builtins provide access to the new integer and sub-integer variants of MMA (matrix multiply-accumulate) instructions provided