wrongtest opened a new pull request #6062: URL: https://github.com/apache/incubator-tvm/pull/6062
Hi there, this PR is a minor modification to CombineParallelDense pass, refer to https://discuss.tvm.ai/t/yet-another-dense-op-combine-strategy/7126. The changes are: - Add option "to_batch" (default to True) to control whether combine dense ops into `batch_matmul` or `dense`. - Add implementation to combine dense ops into one large `dense` instead of `batch_matmul`, which take almost same logic with that of `CombineParallelConv2D` pass. - Test cases for combine various shapes of elem-wise op followed. - The new strategy can combine even ops of different output dims and may take better performance in circumstances where flat matmul operation is faster than equivalent batch_matmul operation. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
