mistercrunch commented on issue #6066: Sorting rows when rollup is disabled URL: https://github.com/apache/incubator-druid/issues/6066#issuecomment-410313066 > Btw, this line of thinking is most effective if your queryGranularity is coarse, since Druid always sorts time first and then your other dimensions In light of what you described, my intuition tells me that this might be a bad default in most cases (expect when `queryGranularity == segmentGranularity`). Can you think of reasons why `__time` this would be a hard requirement, or could this be changed? Of course we can't just change it as it would have complex implications on query perf, but maybe a new ingestion spec flag `sortyBy` that would allow putting `__time` elsewhere might help a lot in getting better compression. I see Druid does RLE encoding of bitmaps (in at least some cases), and I guess LZ4 does something that's effectively a bit like RLE encoding on column data. It seems like there's an opportunity to give people powerful options to get better compression / perf here.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
