On 03/15/2012 02:35 PM, Andrew Stitcher (Updated) (JIRA) wrote:

      [ 
https://issues.apache.org/jira/browse/QPID-3893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Stitcher updated QPID-3893:
----------------------------------

     Fix Version/s: 0.17

I think, as this is a regression, this needs to be in 0.16, or else we need to revert the changes that caused it.


C++ broker appears to segfault during MultipleTransactedBatchProducerTest
-------------------------------------------------------------------------

                 Key: QPID-3893
                 URL: https://issues.apache.org/jira/browse/QPID-3893
             Project: Qpid
          Issue Type: Bug
          Components: C++ Broker
    Affects Versions: 0.15, 0.16, 0.17
            Reporter: Robbie Gemmell
            Assignee: Andrew Stitcher
            Priority: Critical
             Fix For: 0.17

         Attachments: testlog.zip


The C++ broker appears to segfault during MultipleTransactedBatchProducerTest. 
Several instances can be found in the history from our CI jobs:
https://builds.apache.org/job/Qpid-Java-Cpp-Test/496/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/494/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/493/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/491/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/486/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/484/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/482/
During course of the latest test failure in #496 (logs attached), the broker 
logged receiving yet another TxCommit command:
{noformat}
BROKER: 2012-03-11 16:55:15 trace RECV [127.0.0.1:15672-127.0.0.1:34582]: 
Frame[BEbe; channel=0; {TxCommitBody: }] )
{noformat}
After this nothing else is ever logged from the broker process, but exceptions 
start flying from the client indicating the connections were reset:
{noformat}
IoReceiver - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[apache.qpid.client.AMQConnectionDelegate_0_10] previous exception
org.apache.qpid.transport.ConnectionException: Connection reset
        at org.apache.qpid.transport.Connection.exception(Connection.java:512)
        at 
org.apache.qpid.transport.network.Assembler.exception(Assembler.java:107)
        at 
org.apache.qpid.transport.network.InputHandler.exception(InputHandler.java:199)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:169)
        at java.lang.Thread.run(Thread.java:662)
Caused by: java.net.SocketException: Connection reset
        at java.net.SocketInputStream.read(SocketInputStream.java:168)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:147)
        ... 1 more
{noformat}
{noformat}
IoSender - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[transport.network.io.IoSender] error in write thread
java.net.SocketException: Connection reset
        at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:96)
        at java.net.SocketOutputStream.write(SocketOutputStream.java:136)
        at org.apache.qpid.transport.network.io.IoSender.run(IoSender.java:313)
        at java.lang.Thread.run(Thread.java:662)
{noformat}
When the test tearDown occurs, it is logged that the broker process exited with 
an abnormal looking exit code, which a quick google suggests is the result of a 
segfault:
{noformat}
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.QpidBrokerTestCase] stopping 
broker on port : 15672
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] 
Destroying broker process
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] broker 
exited: 139
{noformat}
The test fails after 75sec, which is the exact time the test itself allows for the 
process undertaken to complete (artificially high to allow for extreme slowdowns 
on the CI nodes which happened occasionally in the past and also the C++ broker 
being slower at the test due to use of client side selectors; passing results show 
this usually takes 1-3sec on the Java broker and<10sec on the [transient] C++ 
broker), however the test harness tries to clean up the connections use in the 
test during tearDown and actually ends up reporting issues cleaning those up 
rather than the fact that the CountDownLatch used in the test wasnt decremented 
enough times.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]




---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to