leerho commented on issue #7187: Improve topN algorithm URL: https://github.com/apache/incubator-druid/issues/7187#issuecomment-471798796 Ok, this is what I've been suspecting. If I understand what has been said here (and I still may be missing something), then WRT the TopN operations: - In general, anytime you do limiting prior to an aggregation step you can produce errors, and the size of the resulting error will be unknown since it is data sensitive, and there will be no warning. This will be true whether it is performed on the node or on the broker. - Also, the `max(k, 1000)` step may be wasteful in the absence of aggregation, and may not provide enough data in the presence of aggregation. And, again, there will be no warning.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
