sivabalan narayanan created HUDI-8556:
-----------------------------------------
Summary: Trim the number of columns to generate col stats out of
the box
Key: HUDI-8556
URL: https://issues.apache.org/jira/browse/HUDI-8556
Project: Apache Hudi
Issue Type: Improvement
Components: dataskipping, metadata
Reporter: sivabalan narayanan
As of now, out of the box, we generate col stats for all top level fields. This
could be prohibitively expensive for wider tables having 1000 columns. So, we
should trim it down to say first 32 to level columns for a good out of the box
performance.
Users will anyway have an option to override if need be.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)