qingchanghan opened a new pull request, #13879:
URL: https://github.com/apache/tvm/pull/13879

   **Description**
   
   This PR adds a test script to combine CUTLASS and Ansor, which can use 
TensorCore kernels by CUTLASS and keep op fusion and automatic tuning of Ansor.
   
   **Modifications**
   
   1. Add a test script to show how to combine CUTLASS and Ansor.
   2. Add the `other_targets` parameter for Ansor's `extract_tasks` function, 
to pass cutlass target to `call_all_topi_funcs`.
   
   **Performance**
   
   * Bert-large
   * A10, CUDA 11.8
   * FP16
   * Input shape: (8, 128)
   
   Latency(ms)
   
   | Ansor (n=3000)   | CUTLASS+TOPI | CUTLASS+Ansor (n=3000) | Meta-Schedule 
(n=3000) |
   | ------- | ------------ | ------------- | ------------- |
   | 55.8870 | 20.2297      | 17.2543       | 19.2774       |


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to