Github user dbrinegar commented on the issue:
https://github.com/apache/spark/pull/22381
This is quite a nice improvement for monitoring. The rest api is great for
UI and consolidated analytics, but monitoring through it is not as
straightforward as when the data emits directly from the source like this.
There is all kinds of nice context that we get when the data from this spark
node is collected directly from the node itself, and not proxied through
another collector / reporter. It is easier to build a monitoring data model
across the cluster when node, jmx, pod, resource manifests, and spark data all
align by virtue of coming from the same collector. Building a similar view of
the cluster just from the rest api, as a comparison, is simply harder and quite
challenging to do in general purpose terms.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]