mapleFU commented on issue #35726: URL: https://github.com/apache/arrow/issues/35726#issuecomment-1562864751
Thanks @jorisvandenbossche Compression depends on window-size and data distribution. Generally, int64 might be better than int32. It's possible that compression size is greater than it's size before compression. Parquet has two versions of data pages. In Page Version1, all page in a column chunk should share same compression and must be compressed if "compression" is used. On Page Version 2, a page can decide not to compress if it's size grows larger after compressed. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
