[
https://issues.apache.org/jira/browse/SPARK-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13981819#comment-13981819
]
Nan Zhu commented on SPARK-1299:
--------------------------------
addressed in https://github.com/apache/spark/pull/186
> making comments of RDD.doCheckpoint consistent with its usage
> -------------------------------------------------------------
>
> Key: SPARK-1299
> URL: https://issues.apache.org/jira/browse/SPARK-1299
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 1.0.0
> Reporter: Nan Zhu
> Assignee: Nan Zhu
> Priority: Trivial
> Fix For: 1.0.0
>
>
> another trivial thing I found occasionally, the comments of function is
> saying that
> /**
> * Performs the checkpointing of this RDD by saving this. It is called by
> the DAGScheduler
> * after a job using this RDD has completed (therefore the RDD has been
> materialized and
> * potentially stored in memory). doCheckpoint() is called recursively on
> the parent RDDs.
> */
> actually this function is called in SparkContext.runJob
> we can either change the comments or call it in DAGScheduler, I personally
> prefer the later one, as this calling seems like a auto-checkpoint , better
> put it in a non-user-facing component
--
This message was sent by Atlassian JIRA
(v6.2#6252)