We have a single VM that is acting odd. We had 7 SSD OSDs (out of 40) go
down over a period of about 12 hours. These are a cache tier and have size
4, min_size 2. I'm not able to make heads or tails of the error and hoped
someone here could help.

2016-01-14 23:09:54.559121 osd.136 [ERR] 13.503 copy from
f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 to
f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 data digest
0x92bc163c != source 0x8fe2d0a9

The PG fully recovered then the error was

2016-01-15 00:39:25.321469 osd.12 [ERR] 13.503 copy from
f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 to
f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 data digest
0x92bc163c != source 0x8fe2d0a9

A deep scrub of the PG comes back clean and a hash of the files on all OSDs
match. The file system on this vm keeps going read only.

The osd file system is EXT4 and this is 0.94.5.

Thanks,

Robert LeBlanc

Sent from a mobile device please excuse any typos.
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to