GitHub user ankurdave opened a pull request:

    https://github.com/apache/spark/pull/4273

    [SPARK-5484] Checkpoint every 25 iterations in Pregel

    Pregel-based iterative algorithms with more than ~50 iterations begin to 
slow down and eventually fail with a StackOverflowError due to Spark's lack of 
support for long lineage chains.
    
    This PR causes Pregel to checkpoint the graph every 25 iterations if the 
checkpoint directory is set.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ankurdave/spark SPARK-5484

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/4273.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #4273
    
----
commit 48364e64388355450c04605898eb443953e1a06e
Author: Ankur Dave <[email protected]>
Date:   2015-01-29T19:25:36Z

    Checkpoint every 25 iterations in Pregel

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to