Hello!

Recently I updated Akka from 2.3.9 to 2.3.11, and for some reason my 
cluster started to fall apart. From time to time I'm getting errros like 
this:

INFO   | jvm 1    | 2015/07/16 11:45:39 | 2015-07-16 16:45:39,369 ERROR  [
EndpointWriter] AssociationError [akka.tcp://[email protected]:2551] 
-> [akka
.tcp://[email protected]:2551]: Error [Invalid address: 
akka.tcp://[email protected]:2551] [                               
                    
INFO   | jvm 1    | 2015/07/16 11:45:39 | akka.remote.InvalidAssociation: 
Invalid address: akka.tcp://[email protected]:2551                 
          
INFO   | jvm 1    | 2015/07/16 11:45:39 | Caused by: akka.remote.transport.
Transport$InvalidAssociationException: The remote system has quarantined 
this system
. No further associations to the remote system are possible until this 
system is restarted.
INFO   | jvm 1    | 2015/07/16 11:45:40 | 2015-07-16 16:45:40,526 WARN   [
ReliableDeliverySupervisor] Association with remote system [akka.tcp:
//HttpCluster@19
2.168.0.202:2551] has failed, address is now gated for [5000] ms. Reason: [
Disassociated]                                                             
         
INFO   | jvm 1    | 2015/07/16 11:45:40 | 2015-07-16 16:45:40,543 WARN   [
EndpointWriter] AssociationError [akka.tcp://[email protected]:2551] 
-> [akka
.tcp://[email protected]:2551]: Error [Invalid address: 
akka.tcp://[email protected]:2551]

I don't see any suspicious activities in logs, like connection reset or 
some other network errors, it just fails. The cluster-specific 
configuration looks like below:

    cluster {
        auto-down-unreachable-after = 10s

        failure-detector {
          threshold = 10
          heartbeat-interval = 10s
          acceptable-heartbeat-pause = 30 s
        }

        role {
          scheduler.min-nr-of-members = 1
          chunk.min-nr-of-members = 1
          http.min-nr-of-members = 1
        }

    }



Can somebody please advice how can I troubleshoot this problem? Or at least 
how can I intercept that cluster error and *restart* the cluster node that 
failed?

Thank you!

-- 
>>>>>>>>>>      Read the docs: http://akka.io/docs/
>>>>>>>>>>      Check the FAQ: 
>>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html
>>>>>>>>>>      Search the archives: https://groups.google.com/group/akka-user
--- 
You received this message because you are subscribed to the Google Groups "Akka 
User List" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.

Reply via email to