virajjasani commented on PR #5396: URL: https://github.com/apache/hadoop/pull/5396#issuecomment-1433668204
Any BP service actor with "Namenode HA state" as "Active" and "Last Heartbeat Response" > 60s (configurable), should be treated as "State Active Namenode". Maybe we can do that. Alright, sorry for adding up more and more comments, let me find the best way to expose things. For cloud native infra, it's still not easy to let metrics be exposed to the pod where we want to but will have to go for some security approvals, will work on this in parallel. Let me try fixing or at least normalizing the Namenode states in such a manner that we can expose "Stale Active Namenode" kind of Namenode HA state in metrics. That would be fairly easy for client to consume. It should also not be backward incompatible given that HDFS-16902 has been a very recent change. So making changes now in it before it can make it to a release should be fine I guess. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org