Github user michaelandrepearce commented on a diff in the pull request: https://github.com/apache/activemq-artemis/pull/1443#discussion_r131769789 --- Diff: docs/user-manual/en/critical-analysis.md --- @@ -0,0 +1,32 @@ +# Critical Analysis of the broker + +There are a few things that can go wrong on a production environment: + +- Bugs, for more than we try they still happen! We always try to correct them, but that's the only constant in software development. +- IO Errors, disks and hardware can go bad +- Memory issues, the CPU can go crazy by another process + +For cases like this, we added a protection to the broker to shut itself down when bad things happen. + +We measure time response in places like: + +- Queue delivery (add to the queue) +- Journal storage +- Paging operations + --- End diff -- I was just thinking the averages and percentiles of this, would make great things to be monitoring when healthy.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---