Hi devs,

Flink's Configuration page [1] has grown intimidatingly long and complex.
Options are described across three main sections: common options (single
section), advanced options (multiple sections), and full reference. The
trailing "background" section further describes the most impactful options
in much greater detail.

Several recent tickets, and a few outstanding, have added missing options
to the configuration documentation. I'd like to propose a goal of
organizing all options in the full reference into alphabetized, tabular
form (one table per section), much like the system metrics [2]. Columns
would be option name, description, and default value.

The common and advanced sections could also be converted to tabular form
with the exception of Kerberos-based Security. Missing options would be
added to the full reference.

Lastly, the simple heuristic for configuring network buffers has prompted
many questions on the mailing list. With the 1.3 release the total and
number of available buffers is reported through metrics and in the web
dashboard. My experience has been that the number of required buffers is
highly dependent on job topology and cluster performance. I propose keeping
the simple heuristic and description while directing users to monitor the
balance of available buffers.

Greg

[1] https://ci.apache.org/projects/flink/flink-docs-master/setup/config.html
[2]
https://ci.apache.org/projects/flink/flink-docs-master/monitoring/metrics.html#system-metrics
[3]
https://ci.apache.org/projects/flink/flink-docs-master/setup/config.html#configuring-the-network-buffers

Reply via email to