if possible, could you share the mds logs at debug level 20
you'll need to set debug_mds = 20 in the conf file until the crash and
revert the level to the default after mds crash


On Tue, Jul 18, 2023 at 9:12 PM <dxo...@naver.com> wrote:

> hello.
> I am using ROK CEPH and have 20 MDSs in use. 10 are in rank 0-9 and 10 are
> in standby.
> I have one ceph filesystem, and 2 mds are trimming.
> Under one FILESYSTEM, there are 6 MDSs in RESOLVE, 1 MDS in REPLAY, and 3
> in ACTIVE.
> For some reason, since 36 hours ago, RESOLVE is stuck in TRIMMING, and so
> are the MDSs in REPLAY.
> I've also tried FAILing each MDS, but to no avail.
> I think something should change when the MDS in REPLAY goes to RESOLVE,
> but I don't know what.
> Even looking at the logs of the REPLAY MDS, it's hard to see any messages
> other than it is TERMINATED every 11 minutes.
> I'm desperate for someone's help.
> _______________________________________________
> ceph-users mailing list -- ceph-users@ceph.io
> To unsubscribe send an email to ceph-users-le...@ceph.io
>
>

-- 
Milind
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to