[
https://issues.apache.org/jira/browse/QPID-3893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Stitcher updated QPID-3893:
----------------------------------
Attachment: 0003-NO-JIRA-Remove-some-namespace-polluting-using-namesp.patch
Necessary extra code changes for the fix to compile under windows
> C++ broker appears to segfault during MultipleTransactedBatchProducerTest
> -------------------------------------------------------------------------
>
> Key: QPID-3893
> URL: https://issues.apache.org/jira/browse/QPID-3893
> Project: Qpid
> Issue Type: Bug
> Components: C++ Broker
> Affects Versions: 0.15, 0.16, 0.17
> Reporter: Robbie Gemmell
> Assignee: Andrew Stitcher
> Priority: Critical
> Fix For: 0.17
>
> Attachments:
> 0001-QPID-3893-C-broker-appears-to-segfault-during-Multip.patch,
> 0003-NO-JIRA-Remove-some-namespace-polluting-using-namesp.patch, testlog.zip
>
>
> The C++ broker appears to segfault during
> MultipleTransactedBatchProducerTest. Several instances can be found in the
> history from our CI jobs:
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/496/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/494/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/493/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/491/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/486/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/484/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/482/
> During course of the latest test failure in #496 (logs attached), the broker
> logged receiving yet another TxCommit command:
> {noformat}
> BROKER: 2012-03-11 16:55:15 trace RECV [127.0.0.1:15672-127.0.0.1:34582]:
> Frame[BEbe; channel=0; {TxCommitBody: }] )
> {noformat}
> After this nothing else is ever logged from the broker process, but
> exceptions start flying from the client indicating the connections were reset:
> {noformat}
> IoReceiver - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR
> [apache.qpid.client.AMQConnectionDelegate_0_10] previous exception
> org.apache.qpid.transport.ConnectionException: Connection reset
> at org.apache.qpid.transport.Connection.exception(Connection.java:512)
> at
> org.apache.qpid.transport.network.Assembler.exception(Assembler.java:107)
> at
> org.apache.qpid.transport.network.InputHandler.exception(InputHandler.java:199)
> at
> org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:169)
> at java.lang.Thread.run(Thread.java:662)
> Caused by: java.net.SocketException: Connection reset
> at java.net.SocketInputStream.read(SocketInputStream.java:168)
> at
> org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:147)
> ... 1 more
> {noformat}
> {noformat}
> IoSender - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR
> [transport.network.io.IoSender] error in write thread
> java.net.SocketException: Connection reset
> at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:96)
> at java.net.SocketOutputStream.write(SocketOutputStream.java:136)
> at org.apache.qpid.transport.network.io.IoSender.run(IoSender.java:313)
> at java.lang.Thread.run(Thread.java:662)
> {noformat}
> When the test tearDown occurs, it is logged that the broker process exited
> with an abnormal looking exit code, which a quick google suggests is the
> result of a segfault:
> {noformat}
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.QpidBrokerTestCase]
> stopping broker on port : 15672
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder]
> Destroying broker process
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder]
> broker exited: 139
> {noformat}
> The test fails after 75sec, which is the exact time the test itself allows
> for the process undertaken to complete (artificially high to allow for
> extreme slowdowns on the CI nodes which happened occasionally in the past and
> also the C++ broker being slower at the test due to use of client side
> selectors; passing results show this usually takes 1-3sec on the Java broker
> and <10sec on the [transient] C++ broker), however the test harness tries to
> clean up the connections use in the test during tearDown and actually ends up
> reporting issues cleaning those up rather than the fact that the
> CountDownLatch used in the test wasnt decremented enough times.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]