chrishkchris commented on issue #535: SINGA-490 Optimize performance of
stochastic gradient descent (SGD)
URL: https://github.com/apache/incubator-singa/pull/535#issuecomment-532944885
Finally, I test the distributed training in AWS p2.x8large, after adding the
Sync() in the SGD loop of re
chrishkchris commented on issue #535: SINGA-490 Optimize performance of
stochastic gradient descent (SGD)
URL: https://github.com/apache/incubator-singa/pull/535#issuecomment-532723250
Next, I further improve the time performance by using in-place elementwise
multiply in many functions suc