Re: [I] Object Store: S3 IP address selection is biased [arrow-rs]

via GitHub Tue, 11 Feb 2025 11:55:49 -0800


reidkaufmann commented on issue #7117:
URL: https://github.com/apache/arrow-rs/issues/7117#issuecomment-2651921814


   FYI: I noticed (when trying to determine the best way to filter a wireshark 
trace for S3 traffic) that the pool of addresses Amazon provides (even over a 
short period of time, i.e a minute) is much bigger than the list in a single 
DNS response.
   
   > this trades cost (via number of requests)
   
   Are you suggesting racing every S3 request or just ones that take "too long" 
(the S3 dock linked seems to be recommending aggressive timeouts).  I'm just 
curious because racing all reads has a bandwidth cost.  If we figure out how to 
saturate our network with S3 traffic ([amazon docs say it's 
possible](https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html#maximize)),
 there could be a significant impact (reflected in latency) to overall system 
performance in cache warming/thrashing scenarios, so we might want to use the 
technique judiciously if we start approaching total network bandwidth.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] Object Store: S3 IP address selection is biased [arrow-rs]

Reply via email to