steveloughran edited a comment on pull request #35569:
URL: https://github.com/apache/spark/pull/35569#issuecomment-1049106381


   * alluxio shouldn't be complaining that the file isn't there. delete(path) 
must return true as the requirement "path is not present when we return" is met.
   * removing checks before delete() saves one round trip when working with 
object stores.
   * valid point about namenode lock overheads, but not something i personally 
worry too much about. a lock of some form may be needed for the exists probe 
too, and you've now got two RPCs. if the situation was that most times you did 
the call the path you wanted to delete wasn't there then maybe it could be 
justified, otherwise it adds 1 call per operation.
   
   overall then, -1 to this, though it does sound like alluxio is overreacting.
   
   >  why doesn't fs.delete check the existence on the server side, I think the 
following ideas might be related.
   
   it doesn't consider deleting a nonexistent path to be an error. after you 
finish the call, the path you passed in isn't there, which is the outcome it is 
trying to offer.  
   
   > Similarly, for users of FileSystem, maybe some FileSystems do check the 
existence before deleting like Alluxio, but, IMHO, we can't ask all the 
FileSystem to do the same, it's better to do the check on users' side.
   
   why?
    
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to