Hi folks, We have two similar ceph deployments, but one of them is having trouble: VMs running with ceph-provided block devices are seeing frequent high io wait, every a few minutes, usually 15-20%, but as high as 60-70%. This is cluster-wide and not correlated with VM's IO load. We turned on rbd cache and enabled writeback in qemu, but the problem persists. No-deepscrub doesn't help either.
Without providing any one of our probably wrong theories, any ideas on how to troubleshoot? Thanks. -Simon
_______________________________________________ ceph-users mailing list [email protected] http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
