Xinli Shang created PARQUET-2075:
------------------------------------
Summary: Unified translation tool
Key: PARQUET-2075
URL: https://issues.apache.org/jira/browse/PARQUET-2075
Project: Parquet
Issue Type: New Feature
Reporter: Xinli Shang
Assignee: Xinli Shang
During the discussion of PARQUET-2071, we came up with the idea of a universal
tool to translate the existing file to a different state while skipping some
level steps like encoding/decoding, to gain speed. For example, only decompress
pages and then compress directly. For PARQUET-2071, we only decrypt and then
encrypt directly. This will be useful for the existing data to onboard Parquet
features like column encryption, zstd etc.
We already have tools like trans-compression, column pruning etc. We will
consolidate all these tools with this universal tool.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)