comaniac opened a new pull request #6584:
URL: https://github.com/apache/incubator-tvm/pull/6584


   As I'm trying to use Ansor to tune the operators generated from 
`te.gradient`, I fixed some issues in this PR so that now Ansor can tune the 
backward ops (regardless the performance). Detail change list:
   
   - [Multiple Output] Create a schedule with all outputs instead of the first 
output.
   - [Python API Printing] Change `tvm.thread_axis` to `te.thread_axis`.
   - [Python API Printing] Add prefix to all iterators to avoid name conflict; 
otherwise we may refer to the wrong iterator with the same name hint (e.g., 
`ax0`). For example:
   
   ```python
   ax0, ... = tuple(pad_temp_data_grad.op.axis) + ...
   ...
   ax0, ... = tuple(pad_temp_shared.op.axis) + ...
   s[pad_temp_data_grad].split(ax0, factor=4)
   ```
   where `ax0` in `split` should refer to the `ax0` from `pad_temp_data_grad`, 
but it has been overridden by `pad_temp_shared` after `cache_read`. This PR 
improves `CleanName` by providing an optional prefix so that we can 
differentiate those iterators by their stages.
   
   cc @merrymercy @jcf94 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to