jcf94 edited a comment on pull request #6529:
URL: https://github.com/apache/incubator-tvm/pull/6529#issuecomment-697100009


   > We probably should only benchmark the sample initial population part and 
compare the actual speedup with idea speedup.
   > We then can know how effective the parallelization is.
   
   Good suggestion, from the tuning log I can see it reduced from an average of 
2~3 seconds:
   ```
   A CUDA test from the master branch
   ...
   Sample Initial Population       #s: 456 fail_ct: 2048   Time elapsed: 2.60
   ```
   to an average of 0.05~0.5 seconds:
   ```
   The same test from this branch(This even tried more populations)
   ...
   Sample Initial Population       #s: 648 fail_ct: 3448   Time elapsed: 0.17
   ```
   on my server. cc @FrozenGene 
   
   And I'm thinking about maybe the Evolutionary Search part can also benefit 
from parallel_for in the same way? @merrymercy @comaniac 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to