wrongtest opened a new pull request #6062:
URL: https://github.com/apache/incubator-tvm/pull/6062


   Hi there, this PR is a minor modification to CombineParallelDense pass, 
refer to https://discuss.tvm.ai/t/yet-another-dense-op-combine-strategy/7126. 
The changes are:
   
   -  Add option "to_batch" (default to True) to control whether combine dense 
ops into `batch_matmul` or `dense`. 
   
   - Add implementation to combine dense ops into one large `dense` instead of 
`batch_matmul`, which take almost same logic with that of 
`CombineParallelConv2D` pass.
   
   - Test cases for combine various shapes of elem-wise op followed.
   
   - The new strategy can combine even ops of different output dims and may 
take better performance in circumstances where flat matmul operation is faster 
than equivalent batch_matmul operation.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to