driazati opened a new pull request, #11463: URL: https://github.com/apache/tvm/pull/11463
More shard rebalancing, while using more GPU shards in experiments reduced CI runtime, in prod these ended up eating all the available GPU capacity and causing queuing delays such that overall runtime was basically the same, but with the added cost of the per-node setup * the extra nodes. Thanks for contributing to TVM! Please refer to guideline https://tvm.apache.org/docs/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from [Reviewers](https://github.com/apache/incubator-tvm/blob/master/CONTRIBUTORS.md#reviewers) by @ them in the pull request thread. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
