leerho commented on issue #7187: Improve topN algorithm
URL: 
https://github.com/apache/incubator-druid/issues/7187#issuecomment-471798796
 
 
   Ok, this is what I've been suspecting.  
   
   If I understand what has been said here (and I still may be missing 
something), then WRT the TopN operations:
   
   - In general, anytime you do limiting prior to an aggregation step you can 
produce errors, and the size of the resulting error will be unknown since it is 
data sensitive, and there will be no warning.  This will be true whether it is 
performed on the node or on the broker.  
   
   - Also, the `max(k, 1000)` step may be wasteful in the absence of 
aggregation, and may not provide enough data in the presence of aggregation.  
And, again, there will be no warning.
   
    
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to