On Tue, Dec 12, 2017 at 8:18 PM, fcid <f...@altavoz.net> wrote:
> Hello everyone,
>
> We had an incident regarding a client which reboot after experiencing some
> issues with a ceph cluster.
>
> The other clients who consume RBD images from the same ceph cluster showed
> and error at the time of the reboot in logs related to libceph.
>
> The errors looks like this:
>
> Dec 10 21:29:52 xxxx kernel: [5830277.680860] WARNING: CPU: 15 PID: 8113 at
> net/ceph/osd_client.c:490 request_reinit+0x141/0x180 [libceph]
> Dec 10 21:29:52 xxxx kernel: [5830277.691032] Modules linked in:
> nfnetlink_queue bluetooth ocfs2 quota_tree binfmt_misc tcp_diag inet_diag
> veth ip_set ip6table_filter ip6_tables xt_nat xt_tcpudp xt_multiport
> xt_conntrack xt_addrtype iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4
> nf_nat_ipv4 nf_nat nf_conntrack rbd libceph ocfs2_dlmfs ocfs2_stack_o2cb
> ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue bonding softdog iptable_filter
> nfnetlink_log nfnetlink intel_rapl sb_edac edac_core x86_pkg_temp_thermal
> intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul
> crc32_pclmul ipmi_ssif ghash_clmulni_intel pcbc ast ttm aesni_intel
> aes_x86_64 drm_kms_helper crypto_simd glue_helper snd_pcm cryptd drm
> snd_timer snd fb_sys_fops intel_cstate syscopyarea input_leds soundcore
> joydev sysfillrect intel_rapl_perf sysimgblt mei_me pcspkr mei ioatdma
> Dec 10 21:29:52 xxxx kernel: [5830277.765547]  lpc_ich shpchp wmi ipmi_si
> ipmi_devintf ipmi_msghandler nfit acpi_pad acpi_power_meter mac_hid
> vhost_net vhost macvtap macvlan ib_iser rdma_cm iw_cm ib_cm ib_core configfs
> iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi sunrpc ip_tables
> x_tables autofs4 raid10 raid456 async_raid6_recov async_memcpy async_pq
> async_xor async_tx xor raid6_pq libcrc32c raid0 multipath linear raid1
> hid_generic usbkbd usbmouse usbhid hid igb ixgbe i2c_algo_bit dca ptp ahci
> pps_core i2c_i801 libahci mdio fjes [last unloaded: quota_tree]
> Dec 10 21:29:52 xxxx kernel: [5830277.816275] CPU: 15 PID: 8113 Comm:
> kworker/15:0 Tainted: G        W 4.10.17-1-pve #1
> Dec 10 21:29:52 xxxx kernel: [5830277.825564] Hardware name: Supermicro
> SYS-1028U-TR4T+/X10DRU-i+, BIOS 2.0c 04/21/2017
> Dec 10 21:29:52 xxxx kernel: [5830277.834272] Workqueue: events
> handle_timeout [libceph]
> Dec 10 21:29:52 xxxx kernel: [5830277.840307] Call Trace:
> Dec 10 21:29:52 xxxx kernel: [5830277.843620] dump_stack+0x63/0x81
> Dec 10 21:29:52 xxxx kernel: [5830277.847846] __warn+0xcb/0xf0
> Dec 10 21:29:52 xxxx kernel: [5830277.851758] warn_slowpath_null+0x1d/0x20
> Dec 10 21:29:52 xxxx kernel: [5830277.856798] request_reinit+0x141/0x180
> [libceph]
> Dec 10 21:29:52 xxxx kernel: [5830277.862403] handle_timeout+0x307/0x5b0
> [libceph]
> Dec 10 21:29:52 xxxx kernel: [5830277.868116] process_one_work+0x1fc/0x4b0
> Dec 10 21:29:52 xxxx kernel: [5830277.873069] worker_thread+0x4b/0x500
> Dec 10 21:29:52 xxxx kernel: [5830277.877561] kthread+0x109/0x140
> Dec 10 21:29:52 xxxx kernel: [5830277.881720]  ?
> process_one_work+0x4b0/0x4b0
> Dec 10 21:29:52 xxxx kernel: [5830277.886851]  ?
> kthread_create_on_node+0x60/0x60
> Dec 10 21:29:52 xxxx kernel: [5830277.892323] ret_from_fork+0x2c/0x40
> Dec 10 21:29:52 xxxx kernel: [5830277.896939] ---[ end trace
> afd30825d5ecd451 ]---
>
> I wonder if this is a bug in KRBD.

This warning indicates a fairly minor issue in the internals of the
kernel client that should be safe to ignore.  request_reinit function
isn't used anywhere in the data or other critical paths.

Thanks,

                Ilya
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to