mapleFU commented on issue #35726:
URL: https://github.com/apache/arrow/issues/35726#issuecomment-1562864751

   Thanks @jorisvandenbossche 
   
   Compression depends on window-size and data distribution. Generally, int64 
might be better than int32.
   
   It's possible that compression size is greater than it's size before 
compression. Parquet has two versions of data pages. In Page Version1, all page 
in a column chunk should share same compression and must be compressed if 
"compression" is used. On Page Version 2, a page can decide not to compress if 
it's size grows larger after compressed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to