vikramsinghchandel commented on issue #10210:
URL: https://github.com/apache/druid/issues/10210#issuecomment-664217155


   > @vikramsinghchandel thanks for the details! There was a bug in native 
batch task with rollup (#9861) when `partitionDimensions` are not set which was 
fixed in 0.19. Could you try again but with explicit `partitionDimensions` (you 
can specify all dimensions in there)? Note that `timestamp` column will not be 
included in hash partitioning with explicit `partitionDimensions` whereas it 
will be when they are implicit. Could you compare the rollup ratio of Hadoop 
and native task with explicit `partitionDimensions`?
   > 
   > Regarding performance, if you still saw difference in performance even 
with the same rollup ratio, I'm wondering how many actual tasks were running 
for hadoop and native. Could you double check they used the same number of 
worker tasks for each map and reduce phases?
   > 
   > BTW, when I tested Indexer last time, it wasn't as good as middleManager 
in terms of performance because of its limits on the global memory usage and 
concurrent segment persists/merges. Perhaps you could see different result with 
middleManagers.
   
   Testing this today will share the results.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to