Hi there. I have two Opsview master servers with a shared storage disk on which the /usr/local/nagios and /var/lib/mysql directories reside.
Yesterday I attempted our first failover test which went surprisingly smoothly apart from two fairly minor issues... 1. The master monitoring server was still defined as the failed node so it complained that it couldn't see its own nagios and opsview processes. 2. The new master could no longer communicate with its slaves. After a reload they all said "Host key verification failed". Thoughts... 1. Will I have to manually change the master monitoring server to the new node after a failover (Advanced / Monitoring Servers / Master Monitoring Server / Host) ? Or perhaps if I define the master monitoring server by the HA "virtual" IP which flip-flops between the two nodes that will suffice ? I assume in the latter instance that Opview will be fooled into thinking both nodes as the same. 2. I can confirm that the slaves have the public key of both master nodes and both master nodes have the public keys of the slaves. I can also confirm that after an "opsview-slave restart" the reverse tunnels were created successfully. I cannot understand what could be wrong. What happens during the reload which would generate a "Host key verification failed" error ? Any help you could provide on these issues would be greatly appreciated. Thanks very much. _______________________________________________ Opsview-users mailing list [email protected] http://lists.opsview.org/listinfo/opsview-users
