[ 
https://issues.apache.org/jira/browse/QPID-3893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robbie Gemmell updated QPID-3893:
---------------------------------

    Description: 
The C++ broker appears to segfault during MultipleTransactedBatchProducerTest. 
Several instances can be found in the history from our CI jobs:

https://builds.apache.org/job/Qpid-Java-Cpp-Test/496/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/494/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/493/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/491/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/486/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/484/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/482/

During course of the latest test failure in #496 (logs attached), the broker 
logged receiving yet another TxCommit command:

{noformat}
BROKER: 2012-03-11 16:55:15 trace RECV [127.0.0.1:15672-127.0.0.1:34582]: 
Frame[BEbe; channel=0; {TxCommitBody: }] )
{noformat}

After this nothing else is ever logged from the broker process, but exceptions 
start flying from the client indicating the connections were reset:

{noformat}
IoReceiver - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[apache.qpid.client.AMQConnectionDelegate_0_10] previous exception
org.apache.qpid.transport.ConnectionException: Connection reset
        at org.apache.qpid.transport.Connection.exception(Connection.java:512)
        at 
org.apache.qpid.transport.network.Assembler.exception(Assembler.java:107)
        at 
org.apache.qpid.transport.network.InputHandler.exception(InputHandler.java:199)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:169)
        at java.lang.Thread.run(Thread.java:662)
Caused by: java.net.SocketException: Connection reset
        at java.net.SocketInputStream.read(SocketInputStream.java:168)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:147)
        ... 1 more
{noformat}

{noformat}
IoSender - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[transport.network.io.IoSender] error in write thread
java.net.SocketException: Connection reset
        at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:96)
        at java.net.SocketOutputStream.write(SocketOutputStream.java:136)
        at org.apache.qpid.transport.network.io.IoSender.run(IoSender.java:313)
        at java.lang.Thread.run(Thread.java:662)
{noformat}

When the test tearDown occurs, it is logged that the broker process exited with 
an abnormal looking exit code, which a quick google suggests is the result of a 
segfault:
{noformat}
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.QpidBrokerTestCase] stopping 
broker on port : 15672
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] 
Destroying broker process
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] broker 
exited: 139
{noformat}

The test fails after 75sec, which is the exact time the test itself allows for 
the process undertaken to complete (artificially high to allow for extreme 
slowdowns on the CI nodes which happened occasionally in the past and also the 
C++ broker being slower at the test due to use of client side selectors; 
passing results show this usually takes 1-3sec on the Java broker and <10sec on 
the [transient] C++ broker), however the test harness tries to clean up the 
connections use in the test during tearDown and actually ends up reporting 
issues cleaning those up rather than the fact that the CountDownLatch used in 
the test wasnt decremented enough times.


  was:
The C++ broker appears to segfault during MultipleTransactedBatchProducerTest. 
Several instances can be found in the history from our CI jobs:

https://builds.apache.org/job/Qpid-Java-Cpp-Test/496/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/494/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/493/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/491/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/486/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/484/
https://builds.apache.org/job/Qpid-Java-Cpp-Test/482/

During course of the latest test failure in #496 (logs attached), the broker 
logged receiving yet another TxCommit command:

{noformat}
BROKER: 2012-03-11 16:55:15 trace RECV [127.0.0.1:15672-127.0.0.1:34582]: 
Frame[BEbe; channel=0; {TxCommitBody: }] )
{noformat}

After this nothing else is ever logged from the broker process, but exceptions 
start flying from the client indicating the connections were reset:

{noformat}
IoReceiver - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[apache.qpid.client.AMQConnectionDelegate_0_10] previous exception
org.apache.qpid.transport.ConnectionException: Connection reset
        at org.apache.qpid.transport.Connection.exception(Connection.java:512)
        at 
org.apache.qpid.transport.network.Assembler.exception(Assembler.java:107)
        at 
org.apache.qpid.transport.network.InputHandler.exception(InputHandler.java:199)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:169)
        at java.lang.Thread.run(Thread.java:662)
Caused by: java.net.SocketException: Connection reset
        at java.net.SocketInputStream.read(SocketInputStream.java:168)
        at 
org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:147)
        ... 1 more
{noformat}

{noformat}
IoSender - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
[transport.network.io.IoSender] error in write thread
java.net.SocketException: Connection reset
        at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:96)
        at java.net.SocketOutputStream.write(SocketOutputStream.java:136)
        at org.apache.qpid.transport.network.io.IoSender.run(IoSender.java:313)
        at java.lang.Thread.run(Thread.java:662)
{noformat}

When the test tearDown occurs, it is logged that the broker process exited with 
an abnormal looking exit code, which a quick google suggests is the result of a 
segfault:
{noformat}
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.QpidBrokerTestCase] stopping 
broker on port : 15672
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] 
Destroying broker process
main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] broker 
exited: 139
{noformat}

The test fails after 75sec, which is the exact time the test itself allows for 
the process undertaken to complete (artificially high to allow for extreme 
slowdowns on the CI nodes which happened occasionally in the past and also the 
C++ broker being slower at the test due to use of client side selectors; 
passing results show this usually takes 1-3sec on the Java broker and <10sec on 
the [transient] C++ broker), however the test hardness tries to clean up the 
connections use in the test during tearDown and actually ends up reporting 
issues cleaning those up rather than the fact that the CountDownLatch used in 
the test wasnt decremented enough times.


    
> C++ broker appears to segfault during MultipleTransactedBatchProducerTest
> -------------------------------------------------------------------------
>
>                 Key: QPID-3893
>                 URL: https://issues.apache.org/jira/browse/QPID-3893
>             Project: Qpid
>          Issue Type: Bug
>          Components: C++ Broker
>    Affects Versions: 0.15
>            Reporter: Robbie Gemmell
>            Priority: Critical
>
> The C++ broker appears to segfault during 
> MultipleTransactedBatchProducerTest. Several instances can be found in the 
> history from our CI jobs:
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/496/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/494/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/493/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/491/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/486/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/484/
> https://builds.apache.org/job/Qpid-Java-Cpp-Test/482/
> During course of the latest test failure in #496 (logs attached), the broker 
> logged receiving yet another TxCommit command:
> {noformat}
> BROKER: 2012-03-11 16:55:15 trace RECV [127.0.0.1:15672-127.0.0.1:34582]: 
> Frame[BEbe; channel=0; {TxCommitBody: }] )
> {noformat}
> After this nothing else is ever logged from the broker process, but 
> exceptions start flying from the client indicating the connections were reset:
> {noformat}
> IoReceiver - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
> [apache.qpid.client.AMQConnectionDelegate_0_10] previous exception
> org.apache.qpid.transport.ConnectionException: Connection reset
>       at org.apache.qpid.transport.Connection.exception(Connection.java:512)
>       at 
> org.apache.qpid.transport.network.Assembler.exception(Assembler.java:107)
>       at 
> org.apache.qpid.transport.network.InputHandler.exception(InputHandler.java:199)
>       at 
> org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:169)
>       at java.lang.Thread.run(Thread.java:662)
> Caused by: java.net.SocketException: Connection reset
>       at java.net.SocketInputStream.read(SocketInputStream.java:168)
>       at 
> org.apache.qpid.transport.network.io.IoReceiver.run(IoReceiver.java:147)
>       ... 1 more
> {noformat}
> {noformat}
> IoSender - localhost/127.0.0.1:15672 2012-03-11 16:55:15,313 ERROR 
> [transport.network.io.IoSender] error in write thread
> java.net.SocketException: Connection reset
>       at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:96)
>       at java.net.SocketOutputStream.write(SocketOutputStream.java:136)
>       at org.apache.qpid.transport.network.io.IoSender.run(IoSender.java:313)
>       at java.lang.Thread.run(Thread.java:662)
> {noformat}
> When the test tearDown occurs, it is logged that the broker process exited 
> with an abnormal looking exit code, which a quick google suggests is the 
> result of a segfault:
> {noformat}
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.QpidBrokerTestCase] 
> stopping broker on port : 15672
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] 
> Destroying broker process
> main 2012-03-11 16:56:29,949 INFO [qpid.test.utils.SpawnedBrokerHolder] 
> broker exited: 139
> {noformat}
> The test fails after 75sec, which is the exact time the test itself allows 
> for the process undertaken to complete (artificially high to allow for 
> extreme slowdowns on the CI nodes which happened occasionally in the past and 
> also the C++ broker being slower at the test due to use of client side 
> selectors; passing results show this usually takes 1-3sec on the Java broker 
> and <10sec on the [transient] C++ broker), however the test harness tries to 
> clean up the connections use in the test during tearDown and actually ends up 
> reporting issues cleaning those up rather than the fact that the 
> CountDownLatch used in the test wasnt decremented enough times.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to