Akira AJISAKA created HDFS-8929:
-----------------------------------
Summary: Add a metric to expose the timestamp of the last journal
Key: HDFS-8929
URL: https://issues.apache.org/jira/browse/HDFS-8929
Project: Hadoop HDFS
Issue Type: New Feature
Components: journal-node
Reporter: Akira AJISAKA
If there are three JNs and only one JN is failing to journal, we can detect it
by monitoring the difference of the last written transaction id among JNs from
NN WebUI or JN metrics. However, it's difficult to define the threshold to
alert because the increase rate of the number of transaction depends on how
busy the cluster is. Therefore I'd like to propose a metric to expose the
timestamp of the last journal. That way we can easily alert if a JN is failing
to journal for some fixed period.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)