mbrookhart opened a new pull request #6533:
URL: https://github.com/apache/incubator-tvm/pull/6533


   I was unable to get scatter working with te schedules on cuda due to the two 
loops that update values in place, so I resorted to using ir_builder directly.
   
   Attempts to better parallelize the algorithm have created some strange 
behavior. If I can get a correct and faster implementation to work, I'll submit 
another PR.
   
   Thanks to @tkonolige for very helpful discussions. @notoraptor, this may be 
useful for your scatter_add work.
   
   cc @zhiics 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to