Zoltán Borók-Nagy created IMPALA-15004:
------------------------------------------
Summary: Puffin stats writer for Iceberg tables
Key: IMPALA-15004
URL: https://issues.apache.org/jira/browse/IMPALA-15004
Project: IMPALA
Issue Type: New Feature
Reporter: Zoltán Borók-Nagy
Assignee: Mihaly Szjatinya
Currently COMPUTE STATS store column statistics only in HMS.
Iceberg has Puffin files for this purpose, but currently there's only a single
blob type (Apache Theta sketches) we can store that only supports NDV.
Impala should comply to Iceberg's standards and write Puffin files. The stats
that cannot be stored in well-known Iceberg Puffin blob types could be stored
in custom Impala blobs. That way all statistics information could be retrieved
from a single place.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)