Github user szhem commented on a diff in the pull request:
https://github.com/apache/spark/pull/19294#discussion_r140654204
--- Diff:
core/src/main/scala/org/apache/spark/internal/io/HadoopMapReduceCommitProtocol.scala
---
@@ -57,6 +57,11 @@ class HadoopMapReduceCommitProtocol(jobId: String, path:
String)
*/
private def absPathStagingDir: Path = new Path(path, "_temporary-" +
jobId)
+ /**
+ * Checks whether there are files to be committed to an absolute output
location.
+ */
+ private def hasAbsPathFiles: Boolean = addedAbsPathFiles != null &&
addedAbsPathFiles.nonEmpty
--- End diff --
Good catch, thank you!
According to the `FileCommitProtocol`, `addedAbsPathFiles` is always null
on driver, so we will not be able to commit or remove these files.
Replaced it with
private def hasAbsPathFiles: Boolean = path != null
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]