thomasfrederikhoeck commented on issue #5882:
URL: https://github.com/apache/arrow-rs/issues/5882#issuecomment-2288624368

   > @Xuanwo I might be network related but I have some feeling that is related 
to how `object_store` or `delta-rs` handles if there is a lower throughput than 
within a Azure data center (some connections going stale while waiting for 
somthing else).
   > 
   > * Running from local laptop in Europe
   > * Azure region is Westeurope
   > * I just benchmarked with `azcopy bench 
"https://ACCOUNT.blob.core.windows.net/CONTAINER?SAS"; --file-count 20 
--size-per-file 10000M`. So 20 files of 10 Gb and here I get a throughput of 
145 Mb/s. It runs through with no failures.
   > 
   > The benchmark took 1+ hours with no failure while the delta-rs call fails 
within a few minutes.
   
   @tustvold The weird thing is that I can run some rather large data 
opeartions (taking an 1+ hour)  with `azcopy bench` without seeing any dropped 
connection or something like that. 
   
   I can maybe add: Before [this 
PR](https://github.com/pola-rs/polars/pull/15083/files ) in polars we sometimes 
saw similar issues but I'm very far from knowledge-able on networking.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to