benedictb opened a new issue #8639: Loading segments on demand URL: https://github.com/apache/incubator-druid/issues/8639 I'd like to use druid as follows: - One tier of historicals would be responsible for the "current" data (< 1 week old), keep all of their assigned segments in the segment cache, and be able to service queries quickly. - Another tier of historicals would be responsible for "backdated" data (> 1 week old). These historicals would each have a larger slice of segments to keep track of, and do not keep all of the segments in the segment cache. These older segments would stay compressed in deep storage most of the time, and if a query needs backdated data, it will pull the data from deep storage to service the query. Queries that target backdated data would certainly take a large performance hit, but since these are rare, it is acceptable. Additionally, once a backdated segment is loaded, it would stay in the "backdated" segment cache until evicted. Is a workflow like this currently possible, that is, on-demand loading of deep storage segments for querying? And if not, how hard would it be to implement something like this?
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
