Re: [ceph-users] mds crash

Peter Tiernan Fri, 29 May 2015 03:35:14 -0700

ok, thats interesting. I had issues before this crash where files werebeing garbled. I followed what I thought was the correct procedure forerasure coded pool with cache tier:


> ceph osd pool create ECpool 800 800 erasure default
> ceph osd pool create CachePool 4096 4096
> ceph osd tier add ECpool CachePool
> ceph osd tier cache-mode CachePool readonly
> ceph osd tier set-overlay ECpool CachePool
> ceph osd pool create cephfs_metadata 4096 4096
> ceph fs new cephfs cephfs_metadata ECpool

Is my mistake the last command above? should the ceph fs new be giventhe CachePool and not the ECpool?


thanks


On 29/05/15 11:17, John Spray wrote:

On 29/05/2015 09:46, Peter Tiernan wrote:
-16> 2015-05-29 09:28:23.106541 7f78c53a9700 10 mds.0.objecter inhandle_osd_op_reply-15> 2015-05-29 09:28:23.106543 7f78c53a9700 7 mds.0.objecterhandle_osd_op_reply 28 ondisk v 0'0 uv 0 in 11.5ce99960 attempt 1-14> 2015-05-29 09:28:23.106546 7f78c53a9700 10 mds.0.objecter op0 rval -95 len 0-13> 2015-05-29 09:28:23.106552 7f78c53a9700 5 mds.0.objecter 1unacked, 1 uncommitted-12> 2015-05-29 09:28:23.106958 7f78c52a8700 10 mds.0.objecterms_dispatch 0x3e2e000 osd_op_reply(29 10000000064.00000004 [trimtrunc2@0] v0'0 uv0 ondisk = -95 ((95) Operation not supported)) v6-11> 2015-05-29 09:28:23.106971 7f78c52a8700 10 mds.0.objecter inhandle_osd_op_reply-10> 2015-05-29 09:28:23.106973 7f78c52a8700 7 mds.0.objecterhandle_osd_op_reply 29 ondisk v 0'0 uv 0 in 11.50e84eb2 attempt 1-9> 2015-05-29 09:28:23.106976 7f78c52a8700 10 mds.0.objecter op0 rval -95 len 0-8> 2015-05-29 09:28:23.106980 7f78c52a8700 5 mds.0.objecter 1unacked, 0 uncommitted-7> 2015-05-29 09:28:23.107296 7f78c69bf700 10 mds.0.objecterms_dispatch 0x3e2e000 osd_op_reply(30 1.00000000 [omap-get-header0~0,omap-get-vals 0~16] v0'0 uv1 ondisk = 0) v6-6> 2015-05-29 09:28:23.107307 7f78c69bf700 10 mds.0.objecter inhandle_osd_op_reply-5> 2015-05-29 09:28:23.107309 7f78c69bf700 7 mds.0.objecterhandle_osd_op_reply 30 ondisk v 0'0 uv 1 in 13.6b2cdaff attempt 0-4> 2015-05-29 09:28:23.107311 7f78c69bf700 10 mds.0.objecter op0 rval 0 len 222-3> 2015-05-29 09:28:23.107313 7f78c69bf700 10 mds.0.objecter op1 rval 0 len 4-2> 2015-05-29 09:28:23.107315 7f78c69bf700 10 mds.0.objecter op1 handler 0x3e316b0-1> 2015-05-29 09:28:23.107321 7f78c69bf700 5 mds.0.objecter 0unacked, 0 uncommitted0> 2015-05-29 09:28:23.108478 7f78cb4d9700 -1 mds/MDCache.cc: Infunction 'virtual void C_IO_MDC_TruncateFinish::finish(int)' thread7f78cb4d9700 time 2015-05-29 09:28:23.107027
mds/MDCache.cc: 5974: FAILED assert(r == 0 || r == -2)
OK, so you have "Operation not supported" coming out of RADOS. Thatusually means you've got CephFS trying to use an erase coded pooldirectly (doesn't work) rather than via a replicated cache pool (doeswork).
You may have found that the filesystem appeared to work up to a pointif you were only writing and not modifying.
John

_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] mds crash

Reply via email to