Hi there.

I have two Opsview master servers with a shared storage disk on which
the /usr/local/nagios and /var/lib/mysql directories reside.

Yesterday I attempted our first failover test which went surprisingly
smoothly apart from two fairly minor issues...

1. The master monitoring server was still defined as the failed node
so it complained that it couldn't see its own nagios and opsview
processes.

2. The new master could no longer communicate with its slaves. After a
reload they all said "Host key verification failed".

Thoughts...

1. Will I have to manually change the master monitoring server to the
new node after a failover (Advanced / Monitoring Servers / Master
Monitoring Server / Host) ? Or perhaps if I define the master
monitoring server by the HA "virtual" IP which flip-flops between the
two nodes that will suffice ? I assume in the latter instance that
Opview will be fooled into thinking both nodes as the same.

2. I can confirm that the slaves have the public key of both master
nodes and both master nodes have the public keys of the slaves. I can
also confirm that after an "opsview-slave restart" the reverse tunnels
were created successfully. I cannot understand what could be wrong.
What happens during the reload which would generate a "Host key
verification failed" error ?

Any help you could provide on these issues would be greatly appreciated.

Thanks very much.
_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/listinfo/opsview-users

Reply via email to