FrozenGene opened a new pull request #4747: [ThreadPool] Solve ARM BIG.LITTLE 
heterogeneous multicores
URL: https://github.com/apache/incubator-tvm/pull/4747
 
 
   Follow up #4344 . Previous PR solves OpenCV + TVM slow performance and 
AutoTVM Python CPU affinity problem. However, previous PR has two problem:
   
   - doesn't solve ARM  BIG.LITTLE heterogeneous multicores.
      For example, we have 2xA72 + 4xA53, previous pr will add all cpu to CPU 
SET even we call `config_thread_pool` to restrict TVM runs on 4 little cores. 
So, TVM_MASTER_THREAD could run A72 big core. This is not we want.
      Solution: we should restrict master in the little cpus or big cpus 
according to users's setting.
   - Not all Linux variant OS has `pthread_atfork`. For example, Alibaba's 
AliOS doesn't have this api.
      Solution: unified android and linux's way. We will call 
`SetFullCpuAffinity` if we don't have `TVM_BIND_MASTER_THREAD`.
   
   @tqchen @yidawang @vinx13 @ajtulloch Could you help to review it? Thanks.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to