GitHub user ankurdave opened a pull request:
https://github.com/apache/spark/pull/4273
[SPARK-5484] Checkpoint every 25 iterations in Pregel
Pregel-based iterative algorithms with more than ~50 iterations begin to
slow down and eventually fail with a StackOverflowError due to Spark's lack of
support for long lineage chains.
This PR causes Pregel to checkpoint the graph every 25 iterations if the
checkpoint directory is set.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ankurdave/spark SPARK-5484
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/4273.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #4273
----
commit 48364e64388355450c04605898eb443953e1a06e
Author: Ankur Dave <[email protected]>
Date: 2015-01-29T19:25:36Z
Checkpoint every 25 iterations in Pregel
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]