[ 
https://issues.apache.org/jira/browse/ARROW-17100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17567851#comment-17567851
 ] 

Antoine Pitrou commented on ARROW-17100:
----------------------------------------

That changeset is ARROW-10353, which fixes bugs both in the read and write path 
for V2 data pages. On the write side, Parquet C++ used not to always set 
{{is_compressed = false}} in the data page, regardless of compression. On the 
read side, Parquet C++ used to always decompress, regardless of the 
{{is_compressed}} flag.



> [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written 
> prior to 3.0.0 per ARROW-10353
> ----------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-17100
>                 URL: https://issues.apache.org/jira/browse/ARROW-17100
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, Parquet
>            Reporter: Wes McKinney
>            Priority: Blocker
>             Fix For: 9.0.0
>
>
> As described in 
> https://lists.apache.org/thread/xkrhgfpk9sr1mj74d4chz3r5yp3szt6c, 
> https://github.com/apache/arrow/commit/ef0feb2c9c959681d8a105cbadc1ae6580789e69
> Caused some files written prior to 3.0.0 to be unreadable. Given that the 
> patch was small, this will hopefully not be too difficult to fix



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to