mistercrunch commented on issue #6066: Sorting rows when rollup is disabled
URL: 
https://github.com/apache/incubator-druid/issues/6066#issuecomment-410313066
 
 
   > Btw, this line of thinking is most effective if your queryGranularity is 
coarse, since Druid always sorts time first and then your other dimensions
   
   In light of what you described, my intuition tells me that this might be a 
bad default in most cases (expect when `queryGranularity == 
segmentGranularity`). Can you think of reasons why `__time` this would be a 
hard requirement, or could this be changed? Of course we can't just change it 
as it would have complex implications on query perf, but maybe a new ingestion 
spec flag `sortyBy` that would allow putting `__time` elsewhere might help a 
lot in getting better compression.
   
   I see Druid does RLE encoding of bitmaps (in at least some cases), and I 
guess LZ4 does something that's effectively a bit like RLE encoding on column 
data. It seems like there's an opportunity to give people powerful options to 
get better compression / perf here.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to