niketh opened a new pull request #5734: Multiple dimension partitioning spec
URL: https://github.com/apache/incubator-druid/pull/5734
 
 
   @gianm  @b-slim  Currently Druid supports only single dimension 
partitioning. This PR extends that and allows for multiple dimension 
partitioning.
   
   A new MultipleDimensionShardSpec has been introduced. The current 
DeterminePartitionsJob has been modified to use MultipleDimensionShardSpec.
   Also to effectively return domain, the MultipleDimensionShardSpec keeps 
track of Max/Min for every dimension.
   
   We have seen MultipleDimension partitioning to be extremely helpful when 
queries are always filtered on a particular set of dimensions - There are far 
lesser segments to scan. 
   
   Before introducing the MultipleDimensionShardSpec, Druid broker would (ask 
historicals) scan all segments for a particular time range, now if the 
filtering key is present in the dimensions of the shardspec, it greatly reduces 
segments to be scanned.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@druid.apache.org
For additional commands, e-mail: commits-h...@druid.apache.org

Reply via email to