There is an issue about this: https://issues.apache.org/jira/browse/ARROW-7830
+1 on changing this to follow the Arrow version number (the current non-changing number is not particularly useful ..) Joris On Fri, 5 Mar 2021 at 19:27, Micah Kornfield <[email protected]> wrote: > There has not been an official release of the Parquet C++ library in quite > some time. I don't think this is a huge issue as the parquet bits are > packaged into each Arrow release. > > However, one practical concern is when bugs crop up for a particular > version writing a parquet file, it is impossible for readers to mitigate > them. One practical example is a long standing bug (with a fix recently > merged) where the comparator for ByteArray/FLBA encoded Decimals was > incorrectly implemented. This means min/max statistics for these Decimal > values cannot be relied on. > > I'd like to propose that we change the default version string [1] for > parquet-cpp to reflect arrow releases (e.g. "parquet-cpp-arrow version > 3.0.0" instead of "parquet-cpp version 1.5.1-snapshot"). > > Any objections? An alternative would be to try to do releases of > parquet-cpp on the same timeline as Arrow releases. > > Thanks, > Micah > > [1] > > https://github.com/apache/arrow/blob/25c736d48dc289f457e74d15d05db65f6d539447/cpp/src/parquet/parquet_version.h.in >
