Hi, I have a Luminous (12.2.25) cluster with several OSDs down. The daemons 
start but they're reporting as down. I did see in some osd logs that heartbeats 
were failing but when I checked the ports for the heartbeats were incorrect for 
that osd, although another osd was listening on that. How does the osd know 
what ports to ping other osds on? Is there any way to force an update.

The reason this happened is because someone took a VM snapshot of this cluster 
and restored the snapshot so the osds aren't up. I know this isn't a good 
implementation or a good idea and this will change going forward.

Anyway, I was just wondering about the heartbeat issue and whether attempting 
to ping on the right ports might bring them up.
Thanks,
Neil.

_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to