> > -----Original Message-----
> > From: Alan Conway [mailto:[email protected]]
> >
> > There is another issue that turned up with the same symptom
> > https://issues.apache.org/jira/browse/QPID-225. Fixed in r888874. Let
> > me know if
> > that resolves your problem, if not I want to look into it more.
> 
> Thanks Alan.
> 
> I don't think that's the issue, because this system should only be
> receiving connections from clients who are using AUTO_ACKNOWLEDGE, but
> I'm happy to try it.
> 
> I will let you know how it goes.

I made a new build of the java client from trunk today, and used that to spam a 
cluster of two brokers running qpidd-0.5.752581-34 and rhm-0.5.3206-27 and saw 
the error again:

2009-dec-15 12:10:34 trace 10.59.174.211:19320(READY) DLVR 27677: Frame[BEbe; 
channel=0; {SessionCompletedBody: commands={ [0,203] }; }] data 
10.59.174.186:24880-40
2009-dec-15 12:10:34 debug [email protected]: 
sender marked completed: { [0,203] }
2009-dec-15 12:10:34 debug Exception constructed: 
[email protected]: confirmed < (204+0) but only 
sent < (203+0) (qpid/SessionState.cpp:163)
2009-dec-15 12:10:34 error Execution exception: invalid-argument: 
[email protected]: confirmed < (204+0) but only 
sent < (203+0) (qpid/SessionState.cpp:163)
2009-dec-15 12:10:34 error 10.59.174.211:19320(READY/error) channel error 27677 
on 10.59.174.186:24880-40(shadow): invalid-argument: 
[email protected]: confirmed < (204+0) but only 
sent < (203+0) (qpid/SessionState.cpp:163) (unresolved: 10.59.174.186:24880 
10.59.174.211:19320 )
2009-dec-15 12:10:34 trace MCAST 10.59.174.211:19320-0: {ClusterErrorCheckBody: 
type=1; frame-seq=27677; }

OS is an up to date installation of RHEL 5.4 32 bit running on VMWare.

Qpidd.conf is:

no-module-dir=true
load-module=/usr/lib/qpid/daemon/cluster.so
load-module=/usr/lib/qpid/daemon/msgstore.so
cluster-mechanism=PLAIN
cluster-username="***"
cluster-password="***"
cluster-name="na1m-dev1"
log-to-file=/var/lib/qpidd/qpidd.log
trace=1
auth=yes
wait=60

JNDI connection string:

connectionfactory.qpidConnectionFactory = 
amqp://***:*...@test/?brokerlist='tcp://***.com:443?ssl='true',connectdelay='1000',connecttimeout='5000'',failover='roundrobin?cyclecount='999'',sync_publish='all'

The SSL is terminated at the load balancer.

I'd love to hear the community's ideas on where to look next.  I recently 
tested out a build of the 0.6beta1 release, and found that the bug (or at least 
the same symptoms) occurred there as well.  I don't have a good fast way to 
reproduce the problem.  This particular crash occurred after I spammed the 
cluster for about 7 minutes, during which time the spammer sent about 20k 
messages of between 1-1000 KB in size through the cluster, whilst periodically 
closing and reopening sending and receiving connections.

Thanks!

Sandy

---------------------------------------------------------------------
Apache Qpid - AMQP Messaging Implementation
Project:      http://qpid.apache.org
Use/Interact: mailto:[email protected]

Reply via email to