jcf94 commented on pull request #6275:
URL: https://github.com/apache/incubator-tvm/pull/6275#issuecomment-674629884


   > Thanks @jcf94 please look into the CI issue. Also please do a benchmark to 
see if the new implementation will/will not affect perf, since it is only fine 
for larger functions and we might still need pool for very fine grained 
parallelism
   
   Had some simple benchmark on the two implementations, the current one even 
works better in large loop size(since each threads' workload is pre-defined by 
the partitioner).
   Currently Ansor's requirement on the `parallel_for` is just used on some 
thread independent tasks, this simple implementation is enough to work well. 
I'm not sure if some tasks with fine grained parallelism will benefit on a pool 
implementation.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to