pg query says the recovery state is:
          "might_have_unfound": [
                { "osd": 11,
                  "status": "querying"},
                { "osd": 13,
                  "status": "already probed"}],

I figured out why it wasn't probing osd.11.

When I manually replaced the disk, I added the OSD to the cluster with a CRUSH weight of 0.

As soon as I changed fixed the CRUSH weight, some PGs were allocated to the OSD, and the probing completed. My PG that was stuck in recovery mode for 24h has been remapped to be on osd.11. I believe this will allow the recovery to complete.





_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to