On 6/4/26 1:33 AM, C U via ceph-users wrote:
Enjoy! Lessons learned and demonstration of "AI enhanced" infrastructure.
TL:DR used ceph-kvstore-tool bluestore-k work around a data corruption
issue cause by machine operator.
https://github.com/tu503/coe/blob/main/2026-06-03-ceph-bulk-rbd-rm/COE.md
This is the fine piece of reading, thank you.
There are few things which are not clear to me:
1. If osd fails to start after abort, this is a bug. There is no link to
issue anywhere in the report. Was bug identified, or at least reported?
2. It says that controller was overloaded. At the same time, there was a
notice on hard drives behind controller. Are you sure it was controller
overloaded, and not a normal drive thrashing? Was you able to reproduce
this with synthetic load? Was was ratio between drives and ports (and of
what kind?) in your system?
_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]