Zoltán Borók-Nagy created IMPALA-11238:
------------------------------------------

             Summary: Avoid the need for COMPUTE STAST for Iceberg tables
                 Key: IMPALA-11238
                 URL: https://issues.apache.org/jira/browse/IMPALA-11238
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
            Reporter: Zoltán Borók-Nagy


We still need to issue COMPUTE STATS for Iceberg tables to do proper planning.
The main reason for it that Iceberg metadata lacks NDV information about 
columns at the table level.

There are plans in Iceberg to store HyperLogLog arrays for data files, so once 
we have that we could use that information.

Until that maybe we could use some heuristics from Iceberg metadata when there 
is no precise NDV available.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to