Absolutely not. Please don't do this. None of the CephFS disaster recovery
tooling in any way plays nicely with a live filesystem.
I haven't looked at these docs in a while, are they not crystal clear about
all these operations being offline and in every way dangerous? :/
-Greg

On Mon, May 7, 2018 at 12:50 PM Ryan Leimenstoll <[email protected]>
wrote:

> Hi All,
>
> We recently experienced a failure with our 12.2.4 cluster running a CephFS
> instance that resulted in some data loss due to a seemingly problematic OSD
> blocking IO on its PGs. We restarted the (single active) mds daemon during
> this, which caused damage due to the journal not having the chance to flush
> back. We reset the journal, session table, and fs to bring the filesystem
> online. We then removed some directories/inodes that were causing the
> cluster to report damaged metadata (and were otherwise visibly broken by
> navigating the filesystem).
>
> With that, there are now some paths that seem to have been orphaned (which
> we expected). We did not run the ‘cephfs-data-scan’ tool [0] in the name of
> getting the system back online ASAP. Now that the filesystem is otherwise
> stable, can we initiate a scan_links operation with the mds active safely?
>
> [0]
> http://docs.ceph.com/docs/luminous/cephfs/disaster-recovery/#recovery-from-missing-metadata-objects
>
> Thanks much,
> Ryan Leimenstoll
> [email protected]
> University of Maryland Institute for Advanced Computer Studies
>
>
> _______________________________________________
> ceph-users mailing list
> [email protected]
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to