slfan1989 commented on PR #7552: URL: https://github.com/apache/ozone/pull/7552#issuecomment-2537668774
> Thanks for the insight @slfan1989 > > > Additionally, I don’t recommend adding too many metrics in the DataNode, as there are already a large number of metrics, which puts considerable pressure on our collection system. > > Can you elaborate on your metrics collection system? Are you using Prometheus or a different DB? @errose28 Thank you very much for your reply! I have added two surveillance screenshots above to illustrate the system status before and after the deletion. We have a dedicated SRE team responsible for overall metrics collection. It was probably last month when a colleague who was working with me reported that the Ozone metrics, especially those from the DataNode, have become excessive, causing delays in data collection. Our backend storage system is OpenTSDB, and part of it has already been upgraded to ClickHouse. I will reach out to my SRE colleagues to gather more information and then have further discussions with you. cc: @adoroszlai @ChenSammi -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
