Hi Everyone,

I propose that we adopt Puffin file format as a file format for statistics
and indexes in Iceberg tables.

Puffin file format specification:
https://github.com/apache/iceberg/blob/master/format/puffin-spec.md
(previous discussions:  https://github.com/apache/iceberg/pull/4944,
https://github.com/apache/iceberg-docs/pull/69)

Intend use:
* statistics in Iceberg tables (see
https://github.com/apache/iceberg/pull/4945 and associated proposed
implementation https://github.com/apache/iceberg/pull/4741)
* in the future: storage for secondary indexes

Puffin file reader and writer implementation:
https://github.com/apache/iceberg/pull/4537

Thanks,
PF

Reply via email to