HeartSaVioR opened a new pull request #26639: [SPARK-29999][SS] Handle 
FileStreamSink metadata correctly for empty partition
URL: https://github.com/apache/spark/pull/26639
 
 
   ### What changes were proposed in this pull request?
   
   This patch checks the existence of output file for each task while 
committing the task, so that it doesn't throw FileNotFoundException while 
creating SinkFileStatus. The check is newly required for DSv2 implementation of 
FileStreamSink, as it is changed to create the output file lazily (as an 
improvement).
   
   ### Why are the changes needed?
   
   Without this patch, FileStreamSink throws FileNotFoundException when writing 
empty partition.
   
   ### Does this PR introduce any user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   Added UT.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to