Hope this helps someone if your recovery is impacting client traffic. We have been migrating OSD hosts and experiencing massive client timeouts due to overwhelming recovery traffic in Jewel (3-4 GB/s), to the point where Areca HBAs would seize up and crash the hosts.
Setting osd_recovery_sleep = 0.5 immediately relieved the problem. I tried the value of 1, but it slowed recovery too much. This seems like a very important operational parameter to note. -- Alex Gorbachev Storcium _______________________________________________ ceph-users mailing list [email protected] http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
