Please find notes attached:

Attendees:

   -

   Micah Kornfield: Databricks
   -

   Adam Reeve: GResearch - Encryption in Rust
   -

   Adrian Garcia Badaracco: Pydantic
   -

   Aditya Bhatnagar: CMU
   -

   Aihua Xu: Snowflake
   -

   Alex Stephen: Google
   -

   Brian Hulette: Google
   -

   Daniel Weeks: Databricks
   -

   Gijs Burghoorn: Polars
   -

   Jeff Plaisance: Snowflake
   -

   Martin Prammer: CMU
   -

   Prateek Gaur: Snowflake
   -

   Raul Cumplido: Quantstack
   -

   Rok Mihevic: GResearch
   -

   Ryan Blue: Databricks
   -

   Sandeep Gottimukkala: Snowflake
   -

   Yun Zou: Snowflake


Agenda:

   -

   Int96 sort order specified
   -

      Explicit (new sort order should be added)  vs implicit (rely on
      version string parsing
      -

         Need to look to see if int96 was ever produced.  Implicit might be
         OK in that case, otherwise Sort Order would be preferred.
         -

   Variant update
   -

      Writing for Avro/Variant (and shredding) with test cases in java is
      merged
      -

      Go Progress:
      -

         Write binary values
         -

         Ability to shred but no control on shredding
         -

         Touch base to see if this is finalized
         -

      Testing
      -

         Existing issue for test data
         <https://github.com/apache/parquet-testing/issues/75>
         -

            Ryan to follow-up with Databricks team on the test issue
            -

         Shredded and read back
         -

         Non-shredded and read back
         -

         Shred everything to shredded schemas for other types and read it
         back
         -

      Release
      -

         Would be nice for Iceberg.
         -

            Need a volunteer. (Aihua volunteered need to work through
            logistics)
            -

   Interval type
   -

      Current proposal:
      -

         YearMonth - annotates int32
         -

         DurationNanos - annotated int64
         -

   New footer (wide schema)
   -

      I/O is still important (need to make sure size is equivalent)
      -

      Stats - make it possible to only parse subsections, should footer be
      subdivided earlier
      -

         Forwards/Backwards compatibility should also be considered in
         design

Reply via email to