Mingkang Li created SPARK-48928:
-----------------------------------

             Summary: Log Warning for Calling .unpersist() on Locally 
Checkpointed RDDs
                 Key: SPARK-48928
                 URL: https://issues.apache.org/jira/browse/SPARK-48928
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 4.0.0
            Reporter: Mingkang Li


*Summary:* 
This change proposes to log a warning message when the {{.unpersist()}} method 
is called on RDDs that have been locally checkpointed in Apache Spark. This 
aims to inform users about the potential risks of unpersisting such RDDs 
without altering the existing behavior of the method.

*Background:*
 Local checkpointing in Spark truncates the lineage of an RDD, meaning that the 
RDD cannot be recomputed from its source. If an RDD that has been locally 
checkpointed is unpersisted, it loses its data and cannot be regenerated. This 
can lead to job failures if subsequent actions or transformations are attempted 
on the unpersisted RDD.

*Proposed Change:* 
To mitigate this issue, a warning message will be logged whenever 
{{.unpersist()}} is called on a locally checkpointed RDD. This approach 
maintains the current functionality while alerting users to the potential 
consequences of their actions. This change is intended to be non-disruptive and 
is a step towards better user awareness and debugging.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to