zsxwing commented on pull request #35994:
URL: https://github.com/apache/spark/pull/35994#issuecomment-1082453714


   > It is very easy to be affected by network blips and causes the streaming 
query failed.
   
   Is it a unique issue of HDFS? If so, I’m surprised that HDFS client cannot 
survive from transient errors. Is Spark the right layer to fix this issue?
   
   In addition, will the same issue happen after `open`? For example, when 
reading the file content? Do we need to worry about other places as well? This 
is not the only place that touches FileSystem in the driver.
   
   Could you also add a unit test to verify the retry code? For example, you 
can use a fake file system to simulate the errors from `open`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to