Github user yinxusen commented on the pull request:
https://github.com/apache/spark/pull/166#issuecomment-38296681
I use the new method to enlarge local update. Test on SVM and
LogisticRegression looks as good as the first version, without the worry of
OOM. This method can get better result in shorter time, especially when the
dataset is too large to cache in memory.
I think this method is much more like the method provided
[here](http://arxiv.org/pdf/1209.2191.pdf) in section 3. I'm not mentioned that
it is a better way, but the original `GradientDescent` is somewhat like an
elephant pulling a small carriage.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---