We have a scenario that nodes lost contact. It happened a few times already. In 
the most recent case, we have a payload node app2 lost contact with all the 
other nodes. This was not because of node went down.

Mar 24 18:00:07 app2 osafdtmd[56436]: NO Lost contact with app4
Mar 24 18:00:08 app2 osafdtmd[56436]: NO Lost contact with db1
Mar 24 18:00:08 app2 osafdtmd[56436]: NO Lost contact with db2

Mar 24 18:03:40 app2 osafdtmd[56436]: NO Lost contact with app1
Mar 24 18:03:40 app2 osafdtmd[56436]: NO Lost contact with app3
Mar 24 18:03:40 app2 osafamfnd[56494]: ER AMF director unexpectedly crashed

db1, db2 and app4 are also payload nodes:
Mar 24 18:00:08 db1 osafdtmd[26145]: NO Lost contact with app2
Mar 24 18:00:08 db2 osafdtmd[3341]: NO Lost contact with app2
Mar 24 18:00:07 app4 osafdtmd[65441]: NO Lost contact with app2

app1 is active controller node:
Mar 24 18:03:36 app1 osafdtmd[27475]: NO Lost contact with app2

app3 is standby controller node:
Mar 24 18:03:36 app3 osafdtmd[64103]: NO Lost contact with app2

What could be the reasons that nodes lost contact other than OpenSAF service 
stopped or node reboot?
If this is a communication problem, how can we verify or diagnose it?

Thank you!


Shu Wang | Senior Analyst | +1(407)708-5117 or x3917| www.NetCracker.com
Proven Partner to Communications Service Providers




________________________________
The information transmitted herein is intended only for the person or entity to 
which it is addressed and may contain confidential, proprietary and/or 
privileged material. Any review, retransmission, dissemination or other use of, 
or taking of any action in reliance upon, this information by persons or 
entities other than the intended recipient is prohibited. If you received this 
in error, please contact the sender and delete the material from any computer.
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to