[ceph-users] pg stuck in unknown state

Michael Thomas Mon, 10 Aug 2020 15:41:09 -0700

On my relatively new Octopus cluster, I have one PG that has beenperpetually stuck in the 'unknown' state. It appears to belong to thedevice_health_metrics pool, which was created automatically by the mgrdaemon(?).

The OSDs that the PG maps to are all online and serving other PGs. Butwhen I list the PGs that belong to the OSDs from 'ceph pg map', theoffending PG is not listed.


# ceph pg dump pgs | grep ^1.0
dumped pgs

1.0 0 0 0 0 00 0 0 0 0 unknown2020-08-08T09:30:33.251653-0500 0'0 0:0[] -1 [] -10'0 2020-08-08T09:30:33.251653-0500 0'02020-08-08T09:30:33.251653-0500 0


# ceph osd pool stats device_health_metrics
pool device_health_metrics id 1
  nothing is going on

# ceph pg map 1.0
osdmap e7199 pg 1.0 (1.0) -> up [41,40,2] acting [41,0]

What can be done to fix the PG? I tried doing a 'ceph pg repair 1.0',but that didn't seem to do anything.

Is it safe to try to update the crush_rule for this pool so that the PGgets mapped to a fresh set of OSDs?


--Mike
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] pg stuck in unknown state

Reply via email to