[ceph-users] Re: Erasure-coded PG stuck in the failed_repair state

2022-05-11 Thread Robert Appleyard - STFC UKRI
t to run rados list-inconsistent-obj. Respectfully, Wes Dillingham w...@wesdillingham.com<mailto:w...@wesdillingham.com> LinkedIn<http://www.linkedin.com/in/wesleydillingham> On Tue, May 10, 2022 at 8:52 AM Robert Appleyard - STFC UKRI mailto:rob.appley...@stfc.ac.uk>> wrote: Hi, W

[ceph-users] Erasure-coded PG stuck in the failed_repair state

2022-05-10 Thread Robert Appleyard - STFC UKRI
Hi, We've got an outstanding issue with one of our Ceph clusters here at RAL. The cluster is 'Echo', our 40PB cluster. We found an object from an 8+3EC RGW pool in the failed_repair state. We aren't sure how the object got into this state, but it doesn't appear to be a case of correlated drive