[ceph-users] Understanding/correcting sudden onslaught of unfound objects

Graham Allan Mon, 12 Feb 2018 16:27:00 -0800

Hi,

For the past few weeks I've been seeing a large number of pgs on ourmain erasure coded pool being flagged inconsistent, followed by thembecoming active+recovery_wait+inconsistent with unfound objects. Thecluster is currently running luminous 12.2.2 but has in the past alsorun its way through firefly, hammer and jewel.

Here's a sample object from "ceph pg list_missing" (there are 150unfound objects in this particular pg):


ceph health detail shows:

    pg 70.467 is stuck unclean for 1004525.715896, current state 
active+recovery_wait+inconsistent, last acting [449,233,336,323,259,193]


ceph pg 70.467 list_missing:

        {
            "oid": {
                "oid": 
"default.323253.6_20150226/Downloads/linux-nvme-HEAD-5aa2ffa/include/config/via/fir.h",
                "key": "",
                "snapid": -2,
                "hash": 628294759,
                "max": 0,
                "pool": 70,
                "namespace": ""
            },
            "need": "73222'132227",
            "have": "0'0",
            "flags": "none",
            "locations": [
                "193(5)",
                "259(4)",
                "449(0)"
            ]
        },

When I trace through the filesystem on each OSD, I find the associatedfile present on each OSD but with size 0 bytes.

Interestingly, for the 3 OSDs for which "list_missing" shows locationsabove (193,259,449), the timestamp of the 0-byte file is recent (withinlast few weeks). For the other 3 OSDs (233,336,323), it's in the distantpast (08/2015 and 02/2016).

All the unfound objects I've checked on this pg show the same pattern,along with the "have" epoch showing as "0'0".

Other than the potential data loss being disturbing, I wonder why thisshowed up so suddenly?

It seems to have been triggered by one OSD host failing over a longweekend. By the time we looked at it on Monday, the cluster hadre-balanced enough data that I decided to simply leave it - we had longwanted to evacuate a first host to convert to a newer OS release, aswell as Bluestore. Perhaps this was a bad choice, but the clusterrecovery appeared to be proceeding normally, and was apparently completea few days later. It was only around a week later that the unfoundobjects started.

All the unfound object file fragments I've tracked down so far havetheir older members with timestamps in the same mid-2015 to mid-2016period. I could be wrong but this really seems like a long-standingproblem has just been unearthed. I wonder if it could be connected tothis thread from early 2016, concerning a problem on the same cluster:


http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-March/008120.html

It's a long thread, but the 0-byte files sound very like the "orphanedfiles" in that thread - related to performing a directory split whilehandling links on a filename with the special long filename handling...


http://lists.ceph.com/pipermail/ceph-users-ceph.com/2016-March/008317.html

However unlike that thread, I'm not finding any other files withduplicate names in the hierarchy.

I'm not sure there's much else I can do besides record the names of anyunfound objects before resorting to "mark_unfound_lost delete" - anysuggestions for further research?


Thanks,

Graham
--
Graham Allan
Minnesota Supercomputing Institute - [email protected]
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[ceph-users] Understanding/correcting sudden onslaught of unfound objects

Reply via email to