psrivas2 commented on PR #14465: URL: https://github.com/apache/tvm/pull/14465#issuecomment-1494910975
> The original intention was to compile all generated files in parallel (via NVCC `-t` flag), but I forgot to actually do it. Have you tested that? I expect that would be faster than this solution. Could you elaborate what `-t` flag would do and how would we use it? Loop [here](https://github.com/apache/tvm/pull/14465/files#diff-9b184ba90f566eaeb8c34e4032b221378e91b04b382f44a6b74786ca09537044L258-L265) processes annotated functions sequentially, so we will still have to parallelize that I think. I did parallelize this [loop](https://github.com/apache/tvm/pull/14465/files#diff-9b184ba90f566eaeb8c34e4032b221378e91b04b382f44a6b74786ca09537044L258-L265) to compile the generated C source modules in parallel but that wasn't faster than compiling a single file. The difference between the two was not huge but compiling a single source module was a bit faster (~50 seconds for single source mod vs ~70 seconds for multiple C source mod in parallel). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
