praveenc7 opened a new pull request, #14001:
URL: https://github.com/apache/pinot/pull/14001
## Summary
For GroupBy queries, the default size of the GroupByResultHolder is set to
10K, which can lead to inefficient resource usage in cases where fewer group-by
keys are expected, such as in queries with highly selective filters.
`select column1, sum(column2) from testTable where column1 in ("123") group
by column1 limit 20000
`
## Description
This update dynamically adjusts the initial capacity of the
GroupByResultHolder based on the filter predicates for such queries. By
aligning the result holder size with the filter, we aim to optimize resource
allocation and improve performance for filtered group-by queries.
## Testing
TODO: Functional tests
Performance evaluation is also required to assess the trade-offs of the
introduced overhead vs. resource optimization.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]