GitHub user suryaprasanna added a comment to the discussion: Parquet Tool 
Interface for File-Level Operations in Clustering

The use cases where we use column pruning is in nullifying unused columns on 
historical data, to save storage space. But users can also leverage the 
interface to remove already drop columns from the data files.

GitHub link: 
https://github.com/apache/hudi/discussions/17958#discussioncomment-15556776

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to