[ https://issues.apache.org/jira/browse/ARTEMIS-1324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16117353#comment-16117353 ]
ASF GitHub Bot commented on ARTEMIS-1324: ----------------------------------------- Github user michaelandrepearce commented on a diff in the pull request: https://github.com/apache/activemq-artemis/pull/1443#discussion_r131769789 --- Diff: docs/user-manual/en/critical-analysis.md --- @@ -0,0 +1,32 @@ +# Critical Analysis of the broker + +There are a few things that can go wrong on a production environment: + +- Bugs, for more than we try they still happen! We always try to correct them, but that's the only constant in software development. +- IO Errors, disks and hardware can go bad +- Memory issues, the CPU can go crazy by another process + +For cases like this, we added a protection to the broker to shut itself down when bad things happen. + +We measure time response in places like: + +- Queue delivery (add to the queue) +- Journal storage +- Paging operations + --- End diff -- I was just thinking the averages and percentiles of this, would make great things to be monitoring when healthy. > Critical Analysis and deadlock detection on broker > -------------------------------------------------- > > Key: ARTEMIS-1324 > URL: https://issues.apache.org/jira/browse/ARTEMIS-1324 > Project: ActiveMQ Artemis > Issue Type: New Feature > Components: Broker > Reporter: clebert suconic > Assignee: clebert suconic > Priority: Critical > Fix For: 2.3.0 > > -- This message was sent by Atlassian JIRA (v6.4.14#64029)