[
https://issues.apache.org/jira/browse/ARROW-17100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17567851#comment-17567851
]
Antoine Pitrou commented on ARROW-17100:
----------------------------------------
That changeset is ARROW-10353, which fixes bugs both in the read and write path
for V2 data pages. On the write side, Parquet C++ used not to always set
{{is_compressed = false}} in the data page, regardless of compression. On the
read side, Parquet C++ used to always decompress, regardless of the
{{is_compressed}} flag.
> [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written
> prior to 3.0.0 per ARROW-10353
> ----------------------------------------------------------------------------------------------------------
>
> Key: ARROW-17100
> URL: https://issues.apache.org/jira/browse/ARROW-17100
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Parquet
> Reporter: Wes McKinney
> Priority: Blocker
> Fix For: 9.0.0
>
>
> As described in
> https://lists.apache.org/thread/xkrhgfpk9sr1mj74d4chz3r5yp3szt6c,
> https://github.com/apache/arrow/commit/ef0feb2c9c959681d8a105cbadc1ae6580789e69
> Caused some files written prior to 3.0.0 to be unreadable. Given that the
> patch was small, this will hopefully not be too difficult to fix
--
This message was sent by Atlassian Jira
(v8.20.10#820010)