himanshug commented on issue #8038: Making optimal usage of multiple segment 
cache locations
URL: https://github.com/apache/incubator-druid/pull/8038#issuecomment-510627210
 
 
   it doesn't hurt to make the strategy configurable , however I think 
"Least-Bytes-Used" should be default instead of "Round-Robin" . 
   
   >  Let us consider we have 3 locations. If a location (location 1) loads a 
segment of size 1GB, the subsequent calls to load segments of lesser sizes will 
be distributed between locations 2 and 3 until both of them reach/cross 1 GB. 
This repeats every time a particular location loads a bigger size segment. This 
might not have optimal write throughput in such a scenario. However, I'm not 
sure how much of a problem is this.
   write happens in 1 or very few threads so write throughput is not impacted 
and on the contrary it improves read throughput due to similar utilization 
space in each location which has significantly higher concurrency. 
   
   many times users add new segment locations(after the node has been in use 
for a while and already has some data) and restart the node, with "Round Robin" 
newly added location will likely stay underutilized . Round-Robin wouldn't 
solve #7641 in that case.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to