Re: [Parquet] ALP Encoding for Floating point data

PRATEEK GAUR Tue, 17 Feb 2026 16:28:08 -0800

Hi team,

1) Andrew


   - Thanks for working on test files. My PR did add all the test files I
   used to benchmark on datasets. Maybe we can club it together. WIll also aid
   cross language testing
   -  Kosta Tarasov working on Rust implementation. This is great. Thanks


2) Antoine

   - Thanks a lot for reporting the numbers on AMD. Looks like you are
   getting 8X the decoding performance of BSS. This is amazing!!.
   - Thanks for acknowledging the sampling design.
   - I agree with you on Fastlanes. In some crude experiments I didn't get
   a good perf benefit from it on Graviton3 (but maybe there was something
   wrong with my implementation).
   - Locking the 16bit exception encoding for the spec in this case.
   - Awesome I think we have solved for all open questions minus the
   version byte :). (will get back on this soon)


3) Micah

   - FastLanes : The current spec does allow for using FastLane with the
   configurable enum value for layout. We should be able to inject any layout
   in the current design.


Working on resolving all remaining open comments on the spec this week.

Best
Prateek


On Tue, Feb 10, 2026 at 3:37 AM Steve Loughran <[email protected]> wrote:

> On Sun, 8 Feb 2026 at 18:12, Micah Kornfield <[email protected]>
> wrote:
>
> >
> >
> > It looks like the actual issue described for ORC in the paper is that it
> > has multiple sub-encodings in a batch.  This is different then the design
> > proposed here where there is still fixed encoding per page in parquet.
> > Given reasonably sized pages I don't think branch misprediction should
> be a
> > big issue for new encodings.  I agree that we should be conservative in
> > general for adding new encodings.
> >
> >
> +1
>

Re: [Parquet] ALP Encoding for Floating point data

Reply via email to