[ https://issues.apache.org/jira/browse/ARROW-6216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney reassigned ARROW-6216: ----------------------------------- Assignee: Martin Radev > [C++] Allow user to select the compression level > ------------------------------------------------ > > Key: ARROW-6216 > URL: https://issues.apache.org/jira/browse/ARROW-6216 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ > Reporter: Martin Radev > Assignee: Martin Radev > Priority: Minor > Labels: pull-request-available > Original Estimate: 2h > Time Spent: 13h 40m > Remaining Estimate: 0h > > The compression level selected in Arrow for ZSTD is 1 which is the minimal > compression level for the compressor. This leads to very high compression > speed at the sacrifice of compression ratio. > The user should be allowed to select the compression level as both speed and > ratio are data specific. > The proposed solution is to expose the knob via an environment variable such > as ARROW_ZSTD_COMPRESSION_LEVEL. > Example: > export ARROW_ZSTD_COMPRESSION_LEVEL=10 > ./my_parquet_app > Here is a test run with compression levels of 1, 2 and 5: > Level Time (s) Size (mb) > 1 13.02 181 > 2 13.10 177 > 5 19.44 148 -- This message was sent by Atlassian Jira (v8.3.2#803003)