Zoltán Borók-Nagy created IMPALA-15004:
------------------------------------------

             Summary: Puffin stats writer for Iceberg tables
                 Key: IMPALA-15004
                 URL: https://issues.apache.org/jira/browse/IMPALA-15004
             Project: IMPALA
          Issue Type: New Feature
            Reporter: Zoltán Borók-Nagy
            Assignee: Mihaly Szjatinya


Currently COMPUTE STATS store column statistics only in HMS.
Iceberg has Puffin files for this purpose, but currently there's only a single 
blob type (Apache Theta sketches) we can store that only supports NDV.

Impala should comply to Iceberg's standards and write Puffin files. The stats 
that cannot be stored in well-known Iceberg Puffin blob types could be stored 
in custom Impala blobs. That way all statistics information could be retrieved 
from a single place.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to