+1 from me! There may also be people that haven't followed the design discussions and we can start a DISCUSS thread if needed. But if everyone is comfortable with the design and implementation, I think it's ready for a vote as well.
Huge thanks to Piotr for getting this ready! I think the format is going to be really useful for both stats and indexes in Iceberg. On Thu, Jun 9, 2022 at 3:35 AM Piotr Findeisen <pi...@starburstdata.com> wrote: > Hi Everyone, > > I propose that we adopt Puffin file format as a file format for statistics > and indexes in Iceberg tables. > > Puffin file format specification: > https://github.com/apache/iceberg/blob/master/format/puffin-spec.md > (previous discussions: https://github.com/apache/iceberg/pull/4944, > https://github.com/apache/iceberg-docs/pull/69) > > Intend use: > * statistics in Iceberg tables (see > https://github.com/apache/iceberg/pull/4945 and associated proposed > implementation https://github.com/apache/iceberg/pull/4741) > * in the future: storage for secondary indexes > > Puffin file reader and writer implementation: > https://github.com/apache/iceberg/pull/4537 > > Thanks, > PF > > -- Ryan Blue Tabular