virajjasani commented on PR #5396: URL: https://github.com/apache/hadoop/pull/5396#issuecomment-1433612218
Thanks for the reply Ayush, appreciate it as always. Are you saying that default implementation of "logging and/or exposing JMX metrics" for a given datanode if it doesn't stay connected is also not feasible according to you? I know we have metric that says "lastHeartbeat" and "lastHeartbeatResponseTime" but it's still difficult for user or script to apply a loop into BP service actor metrics rather than getting as simple log or metric as "this datanode has not heard from active namenode in the last 60s or so". Are you at least fine with keeping this as default implementation logic? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org