jcf94 commented on pull request #6275: URL: https://github.com/apache/incubator-tvm/pull/6275#issuecomment-674629884
> Thanks @jcf94 please look into the CI issue. Also please do a benchmark to see if the new implementation will/will not affect perf, since it is only fine for larger functions and we might still need pool for very fine grained parallelism Had some simple benchmark on the two implementations, the current one even works better in large loop size(since each threads' workload is pre-defined by the partitioner). Currently Ansor's requirement on the `parallel_for` is just used on some thread independent tasks, this simple implementation is enough to work well. I'm not sure if some tasks with fine grained parallelism will benefit on a pool implementation. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
