gszadovszky commented on PR #3326:
URL: https://github.com/apache/parquet-java/pull/3326#issuecomment-3326642645

   @ArnavBalyan, I have a feeling that this change breaks the contract of 
`Page`. In `ColumnChunkPageReadStore` we actually decompress (and decrypt) the 
data, so the fact that the compressed and decompressed page sizes are equal are 
deliberate. BTW, based on the code it seems that the same problem described in 
this issue also stands for V2 pages.
   
   Since in parquet-cli we only want to show page statistics, I would suggest 
implementing a separate code path that would only read the page headers to 
provide the required values instead of decompressing (and decrypting) the pages 
for no reason.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to