Hello Community!
I would appreciate any help/suggestions with the massive RGWs outage we are
facing.
The cluster's overall status is acceptable (HEALTH_WARN because of some pgs
not scrubbed in time), and the cluster is operational.
However, all RGWs fail to start with a core dump.
The only issue I see at the moment is the RGW GC queue (radosgs-admin gc
list) that contains 600K records.
I believe this could be the root cause of the issue. When I pause OSD iops
(ceph osd pause), all RGWs starting with no issues.
There are no large OMAPs or any other warnings in ceph -s output.

I would appreciate any help or suggestions you can provide.

Sincerely,
Vladimir.
_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to