On 6/4/26 1:33 AM, C U via ceph-users wrote:
Enjoy! Lessons learned and demonstration of "AI enhanced" infrastructure.

TL:DR used ceph-kvstore-tool bluestore-k work around a data corruption issue cause by machine operator.

https://github.com/tu503/coe/blob/main/2026-06-03-ceph-bulk-rbd-rm/COE.md


This is the fine piece of reading, thank you.

There are few things which are not clear to me:

1. If osd fails to start after abort, this is a bug. There is no link to issue anywhere in the report. Was bug identified, or at least reported?

2. It says that controller was overloaded. At the same time, there was a notice on hard drives behind controller. Are you sure it was controller overloaded, and not a normal drive thrashing? Was you able to reproduce this with synthetic load? Was was ratio between drives and ports (and of what kind?) in your system?
_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to