I'm having trouble finding a concise set of steps to repair inconsistent placement groups. I know from other threads that issuing a 'ceph pg repair ...' command could cause loss of data integrity if the primary OSD happens to have the bad copy of the placement group. I know how to find which PG's are bad (ceph pg dump), but I'm not sure how to figure out which objects in the PG failed their CRCs during the deep scrub, and I'm not sure how to get the correct CRC so I can determine which OSD holds the correct copy.
Maybe I'm on the wrong path entirely? If someone knows how to resolve this, I'd appreciate some insight. I think this would be a good topic for adding to the OSD/PG operations section of the manual, or at least a wiki article. Thanks! -Aaron
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com