[ 
https://issues.apache.org/jira/browse/CALCITE-6763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17910796#comment-17910796
 ] 

Julian Hyde commented on CALCITE-6763:
--------------------------------------

Did you see the following comment?
{code}
// TODO: Use a partially-ordered set data structure, so we are not scanning
// through all tiles.
{code}
I think you should implement that suggestion. Also add a test that demonstrates 
the improvement.

> Optimize logic to select the tiles with the fewest rows 
> --------------------------------------------------------
>
>                 Key: CALCITE-6763
>                 URL: https://issues.apache.org/jira/browse/CALCITE-6763
>             Project: Calcite
>          Issue Type: Improvement
>          Components: core
>            Reporter: xiaochen.zhou
>            Priority: Minor
>              Labels: pull-request-available
>
> Our current logic using a PriorityQueue to select the tile with the fewest 
> rows when multiple tiles are available. However, there are several potential 
> issues:
> 1. The PriorityQueue stores all satisfiable tiles. When the number of tiles 
> is large, maintaining the heap structure during element insertion has a time 
> complexity of O(log n), which also increases memory usage.
> 2. The initial size of the PriorityQueue is difficult to estimate and is 
> currently set to 1, causing frequent resizing of the PriorityQueue.
> We can optimize the code by keeping only a single bestCandidate tile to 
> improve performance.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to