Spaarsh commented on issue #13896:
URL: https://github.com/apache/datafusion/issues/13896#issuecomment-2561873904

   > > @Spaarsh Good catch. I'm not sure what the best approach be since 
deleting is a destructive operation. We can either prevent the creation of a 
bad file in the first place (e.g., by verifying its MD5 hash after downloading) 
and keep the "file already exists" error, or prompt the user for an overwrite 
if a file exists. I don't know how to do either in bash but I guess you could 
add it to the PR if it's a simple fix. Otherwise another issue would be great 👍
   > 
   > @Spaarsh do you plan to work on @alihan-synnada suggestions?
   
   Sure! I do have a different approach though. Before the file is not entirely 
downloaded, it can be named something similar to imdb.tmp.tgz. If any error 
occurs, we purge the file before exiting. If no errors are encountered, change 
the filename to the intended one. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to