ArmageddonKnight commented on pull request #8678: URL: https://github.com/apache/tvm/pull/8678#issuecomment-894610419
@junrushao1994 Thanks for letting me know. I had a look into this issue. The problem is caused by assigning `threadIdx.x` to `iv->thread_tag` rather than `iv->var->name_hint` (and therefore, the extractor is unable to correctly extract the number of threads per block). To address this issue, I extend the extractor to cover `thread_tag`'s as well. At the same time, if the number of threads per block is extracted as 1, then `__launch_bounds__` will NOT be printed. I ran the test case again and it works locally. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
