Reminder that next community sync is today in ten minutes.
I am not able to facilitate today but someone will volunteer and take
notes.
Thank you!

On Mon, Jul 7, 2025 at 08:54 Micah Kornfield <emkornfi...@gmail.com> wrote:

> Please find notes belowJun 25, 2025 | Apache Parquet Community Sync
> <
> https://www.google.com/calendar/event?eid=MmZvYnM1cXRoOWQ2aHVwbWRjcTF1azZpdmFfMjAyNTA1MjhUMTcwMDAwWiBqdWxpZW4ubGVkZW1AbQ
> >
>
> Attendees: Apache Parquet Community Sync
> <apache-parquet-community-s...@googlegroups.com>
>
>    -
>
>    Micah Kornfield: Databricks
>    -
>
>    Alex Stephen: Google
>    -
>
>    Alkis Evlogimenos: Databricks
>    -
>
>    Brian Hulette: Google
>    -
>
>    Dewey Dunnington: Wherobots
>    -
>
>    Jonas: Snowflake
>    -
>
>    Jeff Plaisance: Snowflake
>    -
>
>    Rahul Sharma: Databricks
>    -
>
>    Gijs Burghoorn: Polars
>    -
>
>    Marc Cenac: Datadog
>    -
>
>    Martin Prammer: Carnegie Mellon
>    -
>
>    Rahul Sharma: Databricks
>    -
>
>    Russell Spitzer: Snowflake
>    -
>
>    Prateek Gaur: Snowflake
>    -
>
>    Andrew Lamb: InfluxData
>    -
>
>    Fokko Driesprong: Databricks
>    -
>
>    Yun Zou: Snowflake
>    -
>
>    Ashish Paliwal: SumoLogic - listening in
>
>
> Agenda:
>
>    -
>
>    [EXTERNAL] INT96 stats proposal parquet comittee
>    <
> https://docs.google.com/document/d/1Ox0qHYBgs_3-pNqn9V8zVQm_W6qP0lsbd2XwQnQVz1Y/edit
> >
>    -
>
>    Andrew: Rust Variant implementation update. See epic
>    https://github.com/apache/arrow-rs/issues/6736
>    -
>
>    Decfloat proposal: DECFLOAT Parquet Proposal
>    <
> https://docs.google.com/document/d/1j_Q6vnn6Nhy60K4o0tdC91kE5vKGNJaoDOAm71KLzNw/edit?tab=t.0#heading=h.4gcdhz9daib6
> >
>    -
>
>    Interval type
>    <
> https://docs.google.com/document/d/12ghQxWxyAhSQeZyy0IWiwJ02gTqFOgfYm8x851HZFLk/edit?tab=t.0
> >:
>    Iceberg/Parquet Interval Data Type Proposal
>    <
> https://docs.google.com/document/d/12ghQxWxyAhSQeZyy0IWiwJ02gTqFOgfYm8x851HZFLk/edit?tab=t.0#heading=h.rt0cvesdzsj7
> >
>    -
>
>    Flatbuffer footers
>
>
> Notes:
>
>    -
>
>    Int96 timestamp - deprecated, has special comparison logic.
>    -
>
>       Databricks photon is emitting this.
>       -
>
>       Arrow-rs - reads and writes the stats
>       -
>
>       Need version checks in PRs (don’t assume any parquet-mr produced
>       correct stats)
>       -
>
>          Across all implementations
>          -
>
>       New sort order needed?
>       -
>
>          Int96 only has meaning of timestamps, might not be needed.
>          -
>
>          Currently in spec it is undefined.
>          -
>
>       When will PR be merged to make int64 defaults in spark
>       -
>
>          https://github.com/apache/spark/pull/50215
>          -
>
>       Maybe a note should be added in parquet-format explaining why it is
>       deprecated
>       -
>
>    Variant type
>    -
>
>       Implementations started, and running smoothly.  Relatively close to
>       basic reading/writing basic types.  Shredding will take a bit of
> time.
>       -
>
>       Java implementation close to have shredded data.
>       -
>
>       Can Java testing be shared?
>       -
>
>       Maybe we should have a java release.
>       -
>
>       Follow up ticket for getting examples of shredding:
>
> https://github.com/apache/parquet-testing/issues/75#issuecomment-3005659028
>
>       -
>
>    DecFloat
>    -
>
>       Extension type proposal is latest, no objections in theory.  But
>       original author has not have bandwidth to drive through process.
>       -
>
>       Will update the doc with extension type alternative.
>       -
>
>       Extension types could provide actual proof of value.
>       -
>
>    Interval Type
>    -
>
>       Could variable FLBA types. Or parameterized based on type.
>
>
>
> On Mon, Jun 30, 2025 at 5:12 AM Julien Le Dem <jul...@apache.org> wrote:
>
> > Thank you for volunteering.
> > Can you post the meeting notes to the list?
> > Thank you
> > Julien
> >
> > On Wed, Jun 25, 2025 at 6:56 PM Micah Kornfield <emkornfi...@gmail.com>
> > wrote:
> >
> > > I think I volunteered at the last meeting to moderate this one.  See
> > every
> > > one soon.
> > >
> > > On Wed, Jun 25, 2025 at 9:54 AM Julien Le Dem <jul...@apache.org>
> wrote:
> > >
> > > > Reminder that I won’t be able to attend and someone else will
> volunteer
> > > to
> > > > moderate and take notes.
> > > >
> > > > The next Parquet sync is today June 25th at 10am PT - 1pm ET - 7pm
> CET
> > > > To join the invite, join the group:
> > > > https://groups.google.com/g/apache-parquet-community-sync
> > > >
> > > > Everybody is welcome, bring your topic or just listen in.
> > > >
> > > > (Some more details on how the meeting is run:
> > > > https://lists.apache.org/thread/bjdkscmx7zvgfbw0wlfttxy8h6v3f71t )
> > > >
> > >
> >
>

Reply via email to