[
https://issues.apache.org/jira/browse/DISPATCH-383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327955#comment-15327955
]
Vishal Sharda commented on DISPATCH-383:
----------------------------------------
These are intermittent crashes and I do not yet have a test case that can
reliably reproduce them.
1. If I restarted R1 with different number of threads, both R2 and R3 crashed
with the same backtrace which is attached here. On a later run, I saw crash
only in R2.
2. Yes, this could most likely be timing issue with multithreading on. There
is no way for us to control/prevent this from occurring again. The steps
involved were simple - interrupting the router, editing the configuration file
and starting it again.
3. I have not tested this without SSL but the intermittent crashes that I was
seeing due to SASL (DISPATCH-358) no longer appear after upgrading to
Proton-0.13.0-RC. Hence, I keep 2-way SSL enabled for all inter-router
communication during my tests.
> Intermittent router crashes when restarting one router in the network with
> different number of threads
> ------------------------------------------------------------------------------------------------------
>
> Key: DISPATCH-383
> URL: https://issues.apache.org/jira/browse/DISPATCH-383
> Project: Qpid Dispatch
> Issue Type: Bug
> Components: Routing Engine
> Affects Versions: 0.6.0
> Environment: Debian 8.3, Apache Qpid Proton 0.13.0-RC for drivers and
> dependencies, Hardware: 2 CPUs, 15 GB RAM, 60 GB HDD each on 3 separate
> machines
> Reporter: Vishal Sharda
> Assignee: Ganesh Murthy
> Priority: Critical
> Attachments: Crash_route_tables_1.png, Crash_route_tables_2.png,
> Crash_route_tables_3.png
>
>
> Network: A network of 3 interior routers built using the latest trunk and
> connected to each other using 2-way SSL.
> Stopping one router in the network, changing its number of threads in the
> configuration file and starting it again to join the network causes
> intermittent crash in other routers in the network.
> I was able to reproduce the crash three times and collect the backtraces
> inside gdb (screenshots attached).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]