mbs-octoml opened a new pull request, #11923: URL: https://github.com/apache/tvm/pull/11923
The TensorRT BYOC integration needs to 'undo' partitionings in some situations. Add an InlineCompilerFunctions pass to make that robust. In particular, it must undo both the 'partitioning' (ie separating out the "Compiler" function) and any 'compositing' (ie separating out small sub-graphs as "Composite" functions). Fix misspelled nn.bias_add while there. Note that the current implementation is broken but untested in CI. I have all the tests fixed in a follow-up PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
