mbrookhart opened a new pull request #6533: URL: https://github.com/apache/incubator-tvm/pull/6533
I was unable to get scatter working with te schedules on cuda due to the two loops that update values in place, so I resorted to using ir_builder directly. Attempts to better parallelize the algorithm have created some strange behavior. If I can get a correct and faster implementation to work, I'll submit another PR. Thanks to @tkonolige for very helpful discussions. @notoraptor, this may be useful for your scatter_add work. cc @zhiics ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
