Thanks for the reply, Gang. I don't know if deprecation is what I'm going for, 
I would just like a little clarity in Encodings.md around this issue. As it 
stands, it seems like using 33 bits is allowed by the specification, and given 
this<https://github.com/apache/arrow/issues/20374> issue pointed out by Fu 
Xuwei it seems like there isn't a clear path to supporting reading this in 
arrow-cpp (and perhaps arrow-rs as well?). I think adding words to the effect 
that writers should not produce such files, but readers should be able to read 
them (emphasis on "should"), would leave things as they currently stand, but at 
least acknowledge there's tension here. Does this seem reasonable?

Cheers,
Ed

________________________________
From: Gang Wu <ust...@gmail.com>
Sent: Wednesday, October 25, 2023 9:49 PM
To: dev@parquet.apache.org <dev@parquet.apache.org>
Subject: Re: Max bitwidth for delta encoding

Hi Ed,

My concern for changing specs is that existing writer implementations
have already produced parquet files that the change intends to avoid.
So it would be a long time to deprecate the old writers while any reader
implementation should always be able to decode legacy files.

Best,
Gang

Reply via email to