gszadovszky commented on PR #3326: URL: https://github.com/apache/parquet-java/pull/3326#issuecomment-3326642645
@ArnavBalyan, I have a feeling that this change breaks the contract of `Page`. In `ColumnChunkPageReadStore` we actually decompress (and decrypt) the data, so the fact that the compressed and decompressed page sizes are equal are deliberate. BTW, based on the code it seems that the same problem described in this issue also stands for V2 pages. Since in parquet-cli we only want to show page statistics, I would suggest implementing a separate code path that would only read the page headers to provide the required values instead of decompressing (and decrypting) the pages for no reason. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
