[
https://issues.apache.org/jira/browse/HUDI-5738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17821998#comment-17821998
]
Geser Dugarov commented on HUDI-5738:
-------------------------------------
[~guoyihua], [~jonvex], hi!
I'm trying to figure out how to handle huge amount of configuration parameters
in Hudi. So, I started from data collecting and want to share current results.
Note, that all numbers are for the master branch, commit
d0929341d0ad25b3ce587d7f229d2fcdf7796768 at Feb 26, 2024.
The number of parameters is really huge, current total number is {*}899
parameters{*}.
By modules:
* hudi-flink-datasource - 130 parameters,
* hudi-spark-datasource - 45,
* hudi-client-common - 294,
* hudi-common - 184,
* hudi-utilities - 143,
* kafka, hive, sync, aws, gcp, and datahub related - 103.
740 parameters marked as advanced (only 9 from them are marked as deprecated).
For now, we have 159 not advanced parameters.
> Config Simplification
> ---------------------
>
> Key: HUDI-5738
> URL: https://issues.apache.org/jira/browse/HUDI-5738
> Project: Apache Hudi
> Issue Type: Epic
> Components: configs, docs
> Reporter: Jonathan Vexler
> Assignee: Ethan Guo
> Priority: Major
> Fix For: 1.1.0
>
>
> EPIC for simplifying configs and improving usability around configs in Hudi
--
This message was sent by Atlassian Jira
(v8.20.10#820010)