Github user dhruve commented on a diff in the pull request:
https://github.com/apache/spark/pull/15370#discussion_r82181554
--- Diff:
core/src/main/scala/org/apache/spark/rdd/ReliableCheckpointRDD.scala ---
@@ -83,8 +84,9 @@ private[spark] class ReliableCheckpointRDD[T: ClassTag](
* Return the locations of the checkpoint file associated with the given
partition.
*/
protected override def getPreferredLocations(split: Partition):
Seq[String] = {
- val status = fs.getFileStatus(
- new Path(checkpointPath,
ReliableCheckpointRDD.checkpointFileName(split.index)))
+ val path = new Path(checkpointPath,
ReliableCheckpointRDD.checkpointFileName(split.index))
--- End diff --
This is to determine the actual path to the part file. And if we plan to
not parse the old format, then we can hardcode it to the new format, because if
we keep it flexible - we will have to deal with figuring out what formatting
was used or else we would be dealing with `FNF` exceptions.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]