Hi,
I've stumpled upon this a couple of times, where Ceph just stops
responding, but still works.
The cause has been package loss on the network layer, but Ceph doesn't
say anything.
Is there a debug flag for showing retransmission of package, or someway
to see that packages are lost?
Regards,
Ceph messages are transmitted using tcp, so the system isn't directly aware
of packet loss at any level. I suppose we could try and export messenger
reconnect counts via the admin socket, but that'd be a very noisy measure
-- it seems simplest to just query the OS or hardware directly?
-Greg
On