GitHub user zsxwing opened a pull request:

    https://github.com/apache/spark/pull/9428

    Pr7021

    Took over #7021 and fixed the following potential issues in the previous PR:
    
    1. Make sure checkpointing all data even if the Iterator is not drained.
    2. Make sure checkpointing all partitions even if some partitions are not 
touched.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zsxwing/spark pr7021

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/9428.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #9428
    
----
commit d8635168a9f01e3be2b53a27cc5918a1a0ed1612
Author: Liang-Chi Hsieh <[email protected]>
Date:   2015-06-25T16:50:45Z

    Add CheckpointingIterator to optimize checkpointing.

commit 1a3055ea6ca67fcb23c1188b7c3344c726b054b3
Author: Liang-Chi Hsieh <[email protected]>
Date:   2015-06-25T17:10:52Z

    Fix scala style.

commit 3c5b203fd2b85f4110795a1fc6ca3e289ca0d837
Author: Liang-Chi Hsieh <[email protected]>
Date:   2015-06-26T09:49:01Z

    Write checkpoint data to disk if it is at the end of iterator.

commit a829a7d563d1a12e7d99cc1aecf87e264a016f7b
Author: Liang-Chi Hsieh <[email protected]>
Date:   2015-07-03T16:15:19Z

    Merge remote-tracking branch 'upstream/master' into optimize_checkpoint
    
    Conflicts:
        core/src/main/scala/org/apache/spark/rdd/RDDCheckpointData.scala

commit 2f43ff3c6d1a4a428e5cbe8f4a4e4347274fc95c
Author: Liang-Chi Hsieh <[email protected]>
Date:   2015-07-03T16:23:16Z

    Fix scala style.

commit 5c42503f84d496c4db8694f3bc78b9886b597f78
Author: zsxwing <[email protected]>
Date:   2015-11-02T22:37:28Z

    Merge remote-tracking branch 'origin/master' into pr7021

commit 647162fd67d745108ee4816c13d38b71bc71cd59
Author: zsxwing <[email protected]>
Date:   2015-11-03T01:33:22Z

    Fix the corner cases in CheckpointingIterator

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to