Hi Guys,

I've had working OSD's on armhf using 14.2.7 fixed using the workaround from #956293. The OSD and mon worked on armhf 14.2.7 and amd64 14.2.8 (proxmox install). When I upgraded the 14.2.7 cluster to 14.2.9, everything still worked, until I rebooted the proxmox server.
Everything since then just went sauer.

So: I have a complete working ceph cluster on 14.2.9 running on arm. ceph status works. Mapping rbd using echo to the /sys/bus/rbd/add_single_major works (using the username, key and monitors from ceph.conf) on kernel 5.6.11 amd64 and any other kernel (armhf or whatever).
So, the ceph cluster works and the protocol is still correct.

However as soon as I just want to do a ceph status on an amd64, I get an indefinite hanging ceph command line. No way to trace that (please tell me how).
This problem is limited to amd64 though.
When I install ceph on an i386 image, connecting to the ceph cluster works and the cluster is healthy.

So protocol wise amd64 kernel works with 32 bits clusters. But amd64 user space does not work with 32 bits clusters. This might be somewhere in the authentication chain, as 14.2.9 was working (as far as I know) until I rebooted the 64 bit system.
And I think that last CVE fix might be the problem.

Anyway, I hope this reaches someone...
Regards,
Ard van Breemen

Reply via email to