ArmageddonKnight commented on pull request #8678:
URL: https://github.com/apache/tvm/pull/8678#issuecomment-894610419


   @junrushao1994 Thanks for letting me know. I had a look into this issue. The 
problem is caused by assigning `threadIdx.x` to `iv->thread_tag` rather than 
`iv->var->name_hint` (and therefore, the extractor is unable to correctly 
extract the number of threads per block). To address this issue, I extend the 
extractor to cover `thread_tag`'s as well. At the same time, if the number of 
threads per block is extracted as 1, then `__launch_bounds__` will NOT be 
printed. 
   
   I ran the test case again and it works locally.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to