[
https://issues.apache.org/jira/browse/UIMA-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12777038#action_12777038
]
Jerry Cwiklik commented on UIMA-1658:
-------------------------------------
Jorn, I was able to recreate the scenario. When the primary broker is killed, I
can see in jConsole that the UIMA AS service silently reconnects to the
remaining broker. The number of consumers is incremented as expected. I can
also see that the client sends GetMeta request to the queue in the remaining
broker. The enqueue count is incremented, however the dispatch count and
dequeue count are not incremented. This suggests that there is some problem
either in AMQ or Spring. There is no indication in the UIMA AS service that
there is a problem. No exception is thrown from Spring when the primary broker
is killed (as expected). The getMeta request is not being delivered to the UIMA
AS listener object. Can you confirm that this is happening with you as well?
> UIMA AS worker does not respond to client initialize after failover
> -------------------------------------------------------------------
>
> Key: UIMA-1658
> URL: https://issues.apache.org/jira/browse/UIMA-1658
> Project: UIMA
> Issue Type: Bug
> Components: Async Scaleout
> Reporter: Jörn Kottmann
> Fix For: 2.3AS
>
>
> A Pure Master Slave Broker is used to increase availability of the broker.
> More information about it can be found in the activemq documentation:
> http://activemq.apache.org/pure-master-slave.html
> In a test we simulated Master failure through killing the process with kill
> -9.
> Here is the log output from the worker node:
> INFO FailoverTransport - Successfully connected to
> tcp://XXX1:61616
> Here I stopped the master broker process with kill -9
> WARN FailoverTransport - Transport failed to tcp://XXX1:61616 ,
> attempting to automatically reconnect due to: java.io.EOFException
> WARN FailoverTransport - Transport failed to tcp://XXX1:61616 ,
> attempting to automatically reconnect due to: java.io.EOFException
> WARN FailoverTransport - Transport failed to tcp://XXX1:61616 ,
> attempting to automatically reconnect due to: java.io.EOFException
> INFO FailoverTransport - Successfully reconnected to
> tcp://XXX2:61616
> INFO FailoverTransport - Successfully reconnected to
> tcp://XXX2:61616
> INFO FailoverTransport - Successfully reconnected to
> tcp://XXX2:61616
> Afterwards the client was restarted but got a time out error during
> initialize.
> During initializes it sends a message to the worker nodes input queue, but
> this
> message if never retrieved.
> I used the activemq web interface to get some information about the message:
> Command 2001
> MessageFrom ID:XXXX-51032-1257865414664-0:1:1
> ServerURI failover:(tcp://XXX1:61616,tcp://XXX2:61616)?randomize=false
> MessageType 3000
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.