davecromberge opened a new pull request, #12042: URL: https://github.com/apache/pinot/pull/12042
Introduces additional parameters to the DistinctCountThetaSketch aggregation function that give the end-user more control over how sketches are merged. The defaults are selected to ensure that the behaviour remains unchanged over the current implementation. Furthermore, an accumulator custom object is added to ensure that pairwise union operations are avoided as much as possible. Instead, sketches can be aggregated and merged when a threshold is met. This PR is a `performance` enhancement and can be tagged/labelled as such. `release-notes`: - add configuration options for DistinctCountThetaSketchAggregationFunction - respect ordering for existing Theta sketches to use "early-stop" optimisations for unions. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
