Re: ActiveMQ 5.10.0 queue slowed down, restart helped

Mark Schmitt | Intratop Tue, 17 Feb 2015 05:08:37 -0800

Hi,

I work with Piotr on this issue. Let me try to provide some additional
information on our slow-down issue:


Storage is a PostgreSQL Server 9.3.2 on a Debian Wheezy / Kernel 3.2.51-1
System.

We use JDBC and the PGPoolingDataSource
(org.postgresql.ds.PGPoolingDataSource).

This is the persistenceAdapter configuration:
        <persistenceAdapter>

<jdbcPersistenceAdapter dataDirectory="activemq-data"dataSource="#postgres-ds" lockKeepAlivePeriod="0"

createTablesOnStartup="false" />
        </persistenceAdapter>

We have 2 destination interceptors setup. And we run the demo code

(jetty-demo) because we have some applications using the http/restinterface it provides. We don't run camel.

Other than that it's a pretty mondane setup. And we also run twoinstances at the same time as a sort of fail-over. Because of thejdbc-backend, only one of them is active, and we use the failoverprotocol on clientside to use the active one. We use haproxy to servethe webinterface from the active instance. Both activemq-instances runon the same linux box, with different service ip-adresses. (they use thesame binaries, only configuration and data directory are separated). Thereason we run two instances is that we had big stability issues before,with the activemq process sort-of-hangingitself up. We could move away from that setup, because with 5.10 thishasn't happened.

Like the database server, the linux box that runs the activemq instanceis a Debian Wheezy Linux, but with Kernel 3.2.60-1+deb7u1.


Problem description: Once in a while we see 100% cpu load on the database.
We can isolate that to sql statements of the style:

SELECT ID, PRIORITY FROM ACTIVEMQ_MSGS WHEREMSGID_PROD='ID:tomcat10-XXX-41356-1422538681150-1:95156:1:1' ANDMSGID_SEQ='1' AND CONTAINER='queue://XXX_export'

These sql statements take more than 500ms. We've had scenarios wherethey took more than 3 seconds to complete. Queuesize for 500ms was ~1200messages for all queues (concentrated in one queue). With a productionof about 2-3 Messages per seconds and a consumption of about 2 messagesper second. Imho the queuesize and the query-time scales linearly.

We were able to "resolve" the issue by restarting both activemqinstances. After that, the load on the database drops dramatically,instead of 100% cpu usage we see less than 10% on the database and avery fast recovery. The ActiveMQ-Processes look fine too.

My first quess was a missing database index, but they look fine.Besides, restarting the activemq instances resolves the issue .. whichis very very weired for me .. I don't think it's a database lock either,because we couldn't see any and additionally, we see 100% cpu usage forthe process executing the statement (postgres spawns a process perstatement). That should imho (but I'm no database expect) not happen aswell when there's a lock situation...


We're at a loss. Do you guys have an idea?

And one more thing: Once every two or three hours a lot of (severalthousand) messages are created. But the above described problem ishappening irregularly, every one or two weeks or so.


Best regards,
Mark

Re: ActiveMQ 5.10.0 queue slowed down, restart helped

Reply via email to