[GitHub] [incubator-druid] benedictb opened a new issue #8639: Loading segments on demand

GitBox Mon, 07 Oct 2019 14:52:38 -0700

benedictb opened a new issue #8639: Loading segments on demand
URL: https://github.com/apache/incubator-druid/issues/8639
 
 
   I'd like to use druid as follows:
   
   - One tier of historicals would be responsible for the "current" data (< 1 
week old), keep all of their assigned segments in the segment cache, and be 
able to service queries quickly.
   - Another tier of historicals would be responsible for "backdated" data (> 1 
week old). These historicals would each have a larger slice of segments to keep 
track of, and do not keep all of the segments in the segment cache. These older 
segments would stay compressed in deep storage most of the time, and if a query 
needs backdated data, it will pull the data from deep storage to service the 
query. 
   
   Queries that target backdated data would certainly take a large performance 
hit, but since these are rare, it is acceptable. Additionally, once a backdated 
segment is loaded, it would stay in the "backdated" segment cache until 
evicted. 
   
   Is a workflow like this currently possible, that is, on-demand loading of 
deep storage segments for querying? And if not, how hard would it be to 
implement something like this?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [incubator-druid] benedictb opened a new issue #8639: Loading segments on demand

Reply via email to