If tmp message store fills up, broker can deadlock due to while producers wait 
on disk space and consumers wait on acks
-----------------------------------------------------------------------------------------------------------------------

                 Key: AMQ-2475
                 URL: https://issues.apache.org/activemq/browse/AMQ-2475
             Project: ActiveMQ
          Issue Type: Bug
          Components: Broker, Message Store, Transport
    Affects Versions: 5.3.0
         Environment: Tested on Windows XP with JDK 1.60_13, but fairly sure it 
will be an issue on all platforms
            Reporter: Martin Murphy


I will attach a simple project that shows this. In the test the tmp space is 
set to 32 MB and two threads are created. One thread will constantly produce 
1KB messages and the other consumes these, but sleeps for 100ms, note that 
producer flow control is turned off as well. The goal here is to ensure that 
the producers block while the consumers read the rest of the messages from the 
broker and catch up, this in turn frees up the disk space and allows the 
producer to send more messages. This config means that you can bound the broker 
based on disk space rather than memory usage.

Unfortunately in this test using topics while the broker is reading in the 
message from the producer it has to lock the matched list it is adding it to. 
This is an abstract from the Topic's point of view and doesn't realize that the 
file may block based on the file system. 
{code}
    public void add(MessageReference node) throws Exception { //... snip ...
            if (maximumPendingMessages != 0) {
                synchronized (matchedListMutex) {   // We have this mutex
                    matched.addMessageLast(node); // ends up waiting for space
                    // NOTE - be careful about the slaveBroker!
                    if (maximumPendingMessages > 0) {
{code}
Meanwhile the consumer is sending acknowledgements for the 10 messages it just 
read in (the configured prefetch) from the same topic, but since they also 
modify the same list in the topic this waits as well on the mutex held to 
service the producer:
{code}
    private void dispatchMatched() throws IOException {       
        synchronized (matchedListMutex) {  // never gets passed here.
            if (!matched.isEmpty() && !isFull()) {
{code}
This is a fairly classic deadlock. The trick is now how to resolve this given 
the fact that the topic isn't aware that it's list may need to wait for the 
file system to clean up.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to