Hi, During some testing, I hit an issue during unmount that was seems quite similar to the problem I reported a few weeks ago (BUG during umount() after withdrawal). I am using 5.7.1 which includes the patches for the earlier issue. Here is the log:
[ 304.212698] connection1:0: ping timeout of 5 secs expired, recv timeout 5, last rx 4294965568, last ping 4294966848, now 4294968128 [ 304.212834] connection1:0: detected conn error (1022) [ 332.856419] sd 8:0:0:0: rejecting I/O to offline device [ 332.856473] blk_update_request: I/O error, dev sdb, sector 2168544 op 0x0:(READ) flags 0x3000 phys_seg 1 prio class 0 [ 332.856490] blk_update_request: I/O error, dev sdb, sector 47608 op 0x1:(WRITE) flags 0x0 phys_seg 2 prio class 0 [ 332.856500] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Error 10 writing to journal, jid=0 [ 332.856525] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: fatal: I/O error(s) [ 332.856537] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: about to withdraw this file system [ 332.856539] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: dirty_inode: gfs2_trans_begin -30 [ 332.856571] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: The logd daemon is withdrawing. [ 332.856598] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Requesting recovery of jid 0. [ 332.857124] sd 8:0:0:0: [sdb] Synchronizing SCSI cache [ 332.857157] sd 8:0:0:0: [sdb] Synchronize Cache(10) failed: Result: hostbyte=DID_TRANSPORT_FAILFAST driverbyte=DRIVER_OK [ 332.920535] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: No other mounters found. Trying to recover our own journal jid 0. [ 332.920540] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: jid=0: Recovery not attempted due to withdraw. [ 332.920641] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: recovery_result jid 0 ignored due to withdraw. [ 332.920683] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Reprocessing of jid 0 failed with -5. [ 332.920684] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Journal recovery complete for jid 0. [ 332.920684] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Glock dequeues delayed: 1 [ 332.920706] ------------[ cut here ]------------ [ 332.920708] bdi-block not registered [ 332.920719] WARNING: CPU: 5 PID: 6236 at /local/scratch/rossla/src/linux/fs/fs-writeback.c:2323 __mark_inode_dirty+0x269/0x360 [ 332.920719] Modules linked in: gfs2 iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs lockd grace md4 cmac nls_utf8 cifs libarc4 fscache libdes dlm bnx2fc cnic uio fcoe libfcoe libfc scsi_transport_fc 8021q garp mrp stp llc openvswitch nsh nf_conncount nf_nat ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_multiport xt_conntrack nf_conntrack nf_defrag_ipv4 libcrc32c dm_multipath iptable_filter sunrpc dm_mod intel_powerclamp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sg glue_helper igb mei_me video mei intel_rapl_perf ie31200_edac i2c_i801 acpi_power_meter intel_pch_thermal ipmi_si backlight ipmi_devintf ipmi_msghandler ip_tables x_tables hid_generic usbhid hid sd_mod t10_pi ahci libahci xhci_pci intel_ish_ipc libata xhci_hcd intel_ishtp scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua scsi_mod xen_wdt ipv6 crc_ccitt nf_defrag_ipv6 [ 332.920763] CPU: 5 PID: 6236 Comm: python2 Not tainted 5.7.1.uup2 #2 [ 332.920764] Hardware name: Supermicro MBI-6119G-T4/B2SS1-F, BIOS 2.0a 06/10/2017 [ 332.920765] RIP: e030:__mark_inode_dirty+0x269/0x360 [ 332.920766] Code: f6 ff ff 48 89 c5 48 8b 00 f6 40 5c 02 75 1d 48 8b 55 08 83 e2 01 75 14 48 8b 70 50 48 c7 c7 1c 83 e8 81 31 c0 e8 17 26 df ff <0f> 0b 48 8b 05 5e bf d7 00 45 85 f6 49 89 84 24 c8 00 00 00 74 0f [ 332.920767] RSP: e02b:ffffc90041653d28 EFLAGS: 00010282 [ 332.920768] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000001 [ 332.920768] RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffff88811a159840 [ 332.920769] RBP: ffff888109674878 R08: 0000000000000422 R09: 0000000000000422 [ 332.920769] R10: 0000000000000028 R11: ffffffff82602544 R12: ffff888071c99ec0 [ 332.920769] R13: ffff888071c99f48 R14: 0000000000000000 R15: 0000000000000000 [ 332.920777] FS: 00007f77e53eb740(0000) GS:ffff88811a140000(0000) knlGS:0000000000000000 [ 332.920778] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 332.920778] CR2: 00007fab6c9885b0 CR3: 000000007690a000 CR4: 0000000000040660 [ 332.920782] Call Trace: [ 332.920786] generic_update_time+0x6b/0xd0 [ 332.920787] file_update_time+0xaa/0xf0 [ 332.920792] gfs2_set_aops+0x1ed7/0x2130 [gfs2] [ 332.920794] ? get_signal+0x479/0x8f0 [ 332.920796] new_sync_write+0x11e/0x1a0 [ 332.920798] vfs_write+0xad/0x1a0 [ 332.920799] ? kern_select+0xc9/0x100 [ 332.920800] ksys_write+0x55/0xd0 [ 332.920802] do_syscall_64+0x4e/0x140 [ 332.920804] entry_SYSCALL_64_after_hwframe+0x44/0xa9 [ 332.920805] RIP: 0033:0x7f77e4209cd0 [ 332.920806] Code: 73 01 c3 48 8b 0d c0 61 2d 00 f7 d8 64 89 01 48 83 c8 ff c3 66 0f 1f 44 00 00 83 3d cd c2 2d 00 00 75 10 b8 01 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 31 c3 48 83 ec 08 e8 ee cb 01 00 48 89 04 24 [ 332.920807] RSP: 002b:00007ffe316f5108 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 [ 332.920808] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007f77e4209cd0 [ 332.920808] RDX: 0000000000000005 RSI: 00007f77e53f7000 RDI: 0000000000000003 [ 332.920808] RBP: 00007f77e53f7000 R08: 00007f77e53eb740 R09: 000000000073c0a0 [ 332.920822] R10: 000000000000006a R11: 0000000000000246 R12: 0000000000775250 [ 332.920823] R13: 0000000000000005 R14: 00007f77e5393050 R15: 00007f77e5295830 [ 332.920824] ---[ end trace 3cb072c1f481ae5c ]--- [ 332.921511] Buffer I/O error on dev sdb, logical block 262833, lost async page write [ 332.921537] ------------[ cut here ]------------ [ 332.921544] WARNING: CPU: 5 PID: 421 at /local/scratch/rossla/src/linux/fs/gfs2/glops.c:189 gfs2_unregister_debugfs+0xafc/0xb10 [gfs2] [ 332.921544] Modules linked in: gfs2 iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs lockd grace md4 cmac nls_utf8 cifs libarc4 fscache libdes dlm bnx2fc cnic uio fcoe libfcoe libfc scsi_transport_fc 8021q garp mrp stp llc openvswitch nsh nf_conncount nf_nat ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_multiport xt_conntrack nf_conntrack nf_defrag_ipv4 libcrc32c dm_multipath iptable_filter sunrpc dm_mod intel_powerclamp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sg glue_helper igb mei_me video mei intel_rapl_perf ie31200_edac i2c_i801 acpi_power_meter intel_pch_thermal ipmi_si backlight ipmi_devintf ipmi_msghandler ip_tables x_tables hid_generic usbhid hid sd_mod t10_pi ahci libahci xhci_pci intel_ish_ipc libata xhci_hcd intel_ishtp scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua scsi_mod xen_wdt ipv6 crc_ccitt nf_defrag_ipv6 [ 332.921565] CPU: 5 PID: 421 Comm: kworker/5:1H Tainted: G W 5.7.1.uup2 #2 [ 332.921566] Hardware name: Supermicro MBI-6119G-T4/B2SS1-F, BIOS 2.0a 06/10/2017 [ 332.921571] Workqueue: glock_workqueue gfs2_glock_dq_uninit [gfs2] [ 332.921573] RIP: e030:gfs2_unregister_debugfs+0xafc/0xb10 [gfs2] [ 332.921574] Code: e8 59 66 01 00 48 89 ef e8 31 75 86 c0 66 90 5b 5d 44 89 e0 41 5c 41 5d c3 ba 01 00 00 00 48 89 de 31 ff e8 96 d0 ff ff 0f 0b <0f> 0b 89 c6 4c 89 ef e8 31 00 02 00 eb b4 66 0f 1f 44 00 00 0f 1f [ 332.921575] RSP: e02b:ffffc9004050bde0 EFLAGS: 00010282 [ 332.921575] RAX: 00000000fffffffb RBX: ffff888108c6d570 RCX: 0000000000ffffff [ 332.921576] RDX: 0000000000000001 RSI: 00000000000402b5 RDI: ffff888111574748 [ 332.921578] RBP: ffff888108c6d590 R08: ffff88807e5e6fe8 R09: 0000000001000000 [ 332.921579] R10: 0000000001000000 R11: 0000000000000228 R12: 00000000fffffffb [ 332.921580] R13: ffff888111574748 R14: ffffffffc088c3a0 R15: ffff888108c6d590 [ 332.921590] FS: 00007f77e53eb740(0000) GS:ffff88811a140000(0000) knlGS:0000000000000000 [ 332.921590] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 332.921591] CR2: 00007fab6c9885b0 CR3: 0000000002008000 CR4: 0000000000040660 [ 332.921598] Call Trace: [ 332.921601] gfs2_glock_dq_uninit+0x4c6/0x9c0 [gfs2] [ 332.921603] gfs2_glock_dq_uninit+0x908/0x9c0 [gfs2] [ 332.921607] process_one_work+0x166/0x370 [ 332.921609] worker_thread+0x49/0x3e0 [ 332.921610] kthread+0xf8/0x130 [ 332.921611] ? rescuer_thread+0x330/0x330 [ 332.921612] ? kthread_bind+0x10/0x10 [ 332.921614] ret_from_fork+0x35/0x40 [ 332.921617] ---[ end trace 3cb072c1f481ae5d ]--- [ 332.921620] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: Error -5 syncing glock [ 332.921631] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: G: s:EX n:3/402b1 f:lDpiIqo t:UN d:UN/0 a:0 v:0 r:3 m:50 [ 332.921646] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: R: n:262833 f:b0000000 b:57279/57279 i:107 r:0 e:57311 [ 332.921733] ------------[ cut here ]------------ [ 332.921734] kernel BUG at /local/scratch/rossla/src/linux/fs/inode.c:1588! [ 332.921746] invalid opcode: 0000 [#1] SMP NOPTI [ 332.921751] CPU: 5 PID: 6236 Comm: python2 Tainted: G W 5.7.1.uup2 #2 [ 332.921758] Hardware name: Supermicro MBI-6119G-T4/B2SS1-F, BIOS 2.0a 06/10/2017 [ 332.921767] RIP: e030:iput+0x1a5/0x1b0 [ 332.921771] Code: 8b 85 98 00 00 00 eb a5 48 89 ef e8 15 fe ff ff 4c 89 e7 e8 dd d8 e5 ff 66 90 e9 d9 fe ff ff 0f 0b eb 86 0f 0b e9 26 ff ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 57 41 56 49 89 [ 332.921786] RSP: e02b:ffffc90041653c30 EFLAGS: 00010202 [ 332.921791] RAX: dead000000000122 RBX: ffff888004190240 RCX: 0000000080400001 [ 332.921798] RDX: ffff888004190e40 RSI: 0000000000000001 RDI: ffff888071c75c40 [ 332.921805] RBP: ffff888071c75c40 R08: 0000000000000001 R09: 0000000000000000 [ 332.921811] R10: ffff888004548140 R11: 0000000000000000 R12: ffff888111574558 [ 332.921818] R13: ffff888111574558 R14: ffff88810fe92000 R15: ffff88806fb9d7c0 [ 332.921830] FS: 00007f77e53eb740(0000) GS:ffff88811a140000(0000) knlGS:0000000000000000 [ 332.921837] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 332.921843] CR2: 00007fab6c9885b0 CR3: 0000000002008000 CR4: 0000000000040660 [ 332.921853] Call Trace: [ 332.921859] gfs2_jindex_free+0xd5/0x110 [gfs2] [ 332.921866] gfs2_make_fs_ro+0x57e/0x5e0 [gfs2] [ 332.921872] generic_shutdown_super+0x6c/0x120 [ 332.921878] kill_block_super+0x21/0x50 [ 332.921882] deactivate_locked_super+0x3a/0x70 [ 332.921891] cleanup_mnt+0xc7/0x100 [ 332.921896] task_work_run+0x6f/0xa0 [ 332.921901] do_exit+0x2a8/0xb60 [ 332.921906] do_group_exit+0x39/0xa0 [ 332.921910] get_signal+0x17f/0x8f0 [ 332.921915] do_signal+0x30/0x690 [ 332.921921] exit_to_usermode_loop+0x5e/0xba [ 332.921927] do_syscall_64+0x105/0x140 [ 332.921931] entry_SYSCALL_64_after_hwframe+0x44/0xa9 [ 332.921936] RIP: 0033:0x7f77e4209cd0 [ 332.921945] Code: Bad RIP value. [ 332.921948] RSP: 002b:00007ffe316f5108 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 [ 332.921955] RAX: fffffffffffffffb RBX: 0000000000000005 RCX: 00007f77e4209cd0 [ 332.921962] RDX: 0000000000000005 RSI: 00007f77e53f7000 RDI: 0000000000000003 [ 332.921969] RBP: 00007f77e53f7000 R08: 00007f77e53eb740 R09: 000000000073c0a0 [ 332.921975] R10: 000000000000006a R11: 0000000000000246 R12: 0000000000775250 [ 332.921982] R13: 0000000000000005 R14: 00007f77e5393050 R15: 00007f77e5295830 [ 332.921989] Modules linked in: gfs2 iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs lockd grace md4 cmac nls_utf8 cifs libarc4 fscache libdes dlm bnx2fc cnic uio fcoe libfcoe libfc scsi_transport_fc 8021q garp mrp stp llc openvswitch nsh nf_conncount nf_nat ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_multiport xt_conntrack nf_conntrack nf_defrag_ipv4 libcrc32c dm_multipath iptable_filter sunrpc dm_mod intel_powerclamp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sg glue_helper igb mei_me video mei intel_rapl_perf ie31200_edac i2c_i801 acpi_power_meter intel_pch_thermal ipmi_si backlight ipmi_devintf ipmi_msghandler ip_tables x_tables hid_generic usbhid hid sd_mod t10_pi ahci libahci xhci_pci intel_ish_ipc libata xhci_hcd intel_ishtp scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua scsi_mod xen_wdt ipv6 crc_ccitt nf_defrag_ipv6 [ 332.922067] ---[ end trace 3cb072c1f481ae5e ]--- [ 334.650814] RIP: e030:iput+0x1a5/0x1b0 [ 334.650865] Code: 8b 85 98 00 00 00 eb a5 48 89 ef e8 15 fe ff ff 4c 89 e7 e8 dd d8 e5 ff 66 90 e9 d9 fe ff ff 0f 0b eb 86 0f 0b e9 26 ff ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 57 41 56 49 89 [ 334.650880] RSP: e02b:ffffc90041653c30 EFLAGS: 00010202 [ 334.650886] RAX: dead000000000122 RBX: ffff888004190240 RCX: 0000000080400001 [ 334.650892] RDX: ffff888004190e40 RSI: 0000000000000001 RDI: ffff888071c75c40 [ 334.650899] RBP: ffff888071c75c40 R08: 0000000000000001 R09: 0000000000000000 [ 334.650909] R10: ffff888004548140 R11: 0000000000000000 R12: ffff888111574558 [ 334.650916] R13: ffff888111574558 R14: ffff88810fe92000 R15: ffff88806fb9d7c0 [ 334.650928] FS: 00007f77e53eb740(0000) GS:ffff88811a140000(0000) knlGS:0000000000000000 [ 334.650935] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 334.650941] CR2: 00007f77e4209ca6 CR3: 0000000002008000 CR4: 0000000000040660 [ 334.650952] Fixing recursive fault but reboot is needed! [ 334.679677] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: telling LM to unmount [ 334.679701] dlm: 16d3f769-afce-7d: leaving the lockspace group... [ 334.680051] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: recover_prep ignored due to withdraw. [ 334.680152] dlm: 16d3f769-afce-7d: group event done 0 0 [ 334.680280] dlm: 16d3f769-afce-7d: release_lockspace final free [ 334.680300] gfs2: fsid=xapi-clusterd:16d3f769-afce-7d.0: File system withdrawn [ 334.680308] CPU: 6 PID: 6098 Comm: gfs2_logd Tainted: G D W 5.7.1.uup2 #2 [ 334.680315] Hardware name: Supermicro MBI-6119G-T4/B2SS1-F, BIOS 2.0a 06/10/2017 [ 334.680322] Call Trace: [ 334.680332] dump_stack+0x64/0x83 [ 334.680340] gfs2_withdraw+0x147/0x450 [gfs2] [ 334.680350] gfs2_add_revoke+0x304/0x320 [gfs2] [ 334.680361] gfs2_logd+0x334/0x14f0 [gfs2] [ 334.680368] ? wait_woken+0x80/0x80 [ 334.680374] kthread+0xf8/0x130 [ 334.680380] ? gfs2_log_commit+0x350/0x350 [gfs2] [ 334.680386] ? kthread_bind+0x10/0x10 [ 334.680395] ret_from_fork+0x35/0x40 [ 334.680416] BUG: kernel NULL pointer dereference, address: 000000000000005c [ 334.680423] #PF: supervisor read access in kernel mode [ 334.680428] #PF: error_code(0x0000) - not-present page [ 334.680438] PGD 0 P4D 0 [ 334.680445] Oops: 0000 [#2] SMP NOPTI [ 334.680452] CPU: 6 PID: 6098 Comm: gfs2_logd Tainted: G D W 5.7.1.uup2 #2 [ 334.680460] Hardware name: Supermicro MBI-6119G-T4/B2SS1-F, BIOS 2.0a 06/10/2017 [ 334.680469] RIP: e030:gfs2_logd+0x132/0x14f0 [gfs2] [ 334.680475] Code: 00 00 41 8b be 38 08 00 00 41 8b b6 30 08 00 00 01 fa 29 ca 39 f2 0f 83 f6 01 00 00 49 8b 96 88 05 00 00 41 8b 8e 34 08 00 00 <8b> 52 5c f0 49 0f ba 76 50 09 0f 82 a5 01 00 00 41 8b be 38 08 00 [ 334.680493] RSP: e02b:ffffc90041013ea8 EFLAGS: 00010282 [ 334.680499] RAX: 0000000000000001 RBX: ffff888111574050 RCX: 0000000000001ff1 [ 334.680509] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 [ 334.680518] RBP: 0000000000000000 R08: ffffffff82263880 R09: ffffffff8226387c [ 334.680525] R10: ffffffff818001fa R11: 0000000000000000 R12: ffff888111574858 [ 334.680532] R13: ffff888111574280 R14: ffff888111574000 R15: ffff888108cf1d40 [ 334.680546] FS: 00007f838bf868c0(0000) GS:ffff88811a180000(0000) knlGS:0000000000000000 [ 334.680553] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 334.680559] CR2: 000000000000005c CR3: 0000000113c78000 CR4: 0000000000040660 [ 334.680570] Call Trace: [ 334.680575] ? wait_woken+0x80/0x80 [ 334.680580] kthread+0xf8/0x130 [ 334.680585] ? gfs2_log_commit+0x350/0x350 [gfs2] [ 334.680591] ? kthread_bind+0x10/0x10 [ 334.680596] ret_from_fork+0x35/0x40 [ 334.680601] Modules linked in: gfs2 iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs lockd grace md4 cmac nls_utf8 cifs libarc4 fscache libdes dlm bnx2fc cnic uio fcoe libfcoe libfc scsi_transport_fc 8021q garp mrp stp llc openvswitch nsh nf_conncount nf_nat ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_multiport xt_conntrack nf_conntrack nf_defrag_ipv4 libcrc32c dm_multipath iptable_filter sunrpc dm_mod intel_powerclamp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd sg glue_helper igb mei_me video mei intel_rapl_perf ie31200_edac i2c_i801 acpi_power_meter intel_pch_thermal ipmi_si backlight ipmi_devintf ipmi_msghandler ip_tables x_tables hid_generic usbhid hid sd_mod t10_pi ahci libahci xhci_pci intel_ish_ipc libata xhci_hcd intel_ishtp scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua scsi_mod xen_wdt ipv6 crc_ccitt nf_defrag_ipv6 [ 334.680681] CR2: 000000000000005c [ 334.680685] ---[ end trace 3cb072c1f481ae5f ]--- [ 334.684212] RIP: e030:iput+0x1a5/0x1b0 [ 334.684261] Code: 8b 85 98 00 00 00 eb a5 48 89 ef e8 15 fe ff ff 4c 89 e7 e8 dd d8 e5 ff 66 90 e9 d9 fe ff ff 0f 0b eb 86 0f 0b e9 26 ff ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 57 41 56 49 89 [ 334.684278] RSP: e02b:ffffc90041653c30 EFLAGS: 00010202 [ 334.684283] RAX: dead000000000122 RBX: ffff888004190240 RCX: 0000000080400001 [ 334.684290] RDX: ffff888004190e40 RSI: 0000000000000001 RDI: ffff888071c75c40 [ 334.684296] RBP: ffff888071c75c40 R08: 0000000000000001 R09: 0000000000000000 [ 334.684303] R10: ffff888004548140 R11: 0000000000000000 R12: ffff888111574558 [ 334.684310] R13: ffff888111574558 R14: ffff88810fe92000 R15: ffff88806fb9d7c0 [ 334.684322] FS: 00007f838bf868c0(0000) GS:ffff88811a180000(0000) knlGS:0000000000000000 [ 334.684329] CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 334.684335] CR2: 000000000000005c CR3: 0000000113c78000 CR4: 0000000000040660 Steps to reproduce this: Log into an iSCSI target: $ iscsiadm -m discovery -t sendtargets -p 10.71.57.19 $ iscsiadm -m node -T iqn.2014-08.com.example:t1 -p 10.71.57.19 -l Mount the GFS2 fs somewhere: $ mount /dev/sdb /mnt Run a program to write to disk slowly: $ cat wedge.py import time import sys f = open(sys.argv[1], 'w') while True: f.write('blah\n') f.flush() time.sleep(1) $ python2 wedge.py /mnt/test.txt & [1] 6872 Check that it is working: $ cat /mnt/test2.txt blah $ cat /mnt/test2.txt blah blah $ cat /mnt/test2.txt blah blah blah Kill the iSCSI connection: $ iptables -I RH-Firewall-1-INPUT -s 10.71.57.19 -j DROP Wait for the Python process to become blocked: $ ~]# cat /proc/6872/stack [<0>] gfs2_log_reserve+0x13d/0x2e0 [gfs2] [<0>] gfs2_trans_begin+0xc5/0x140 [gfs2] [<0>] gfs2_dinode_out+0x324/0x3c0 [gfs2] [<0>] __mark_inode_dirty+0x17c/0x360 [<0>] generic_update_time+0x6b/0xd0 [<0>] file_update_time+0xaa/0xf0 [<0>] gfs2_set_aops+0x1ed7/0x2130 [gfs2] [<0>] new_sync_write+0x11e/0x1a0 [<0>] vfs_write+0xad/0x1a0 [<0>] ksys_write+0x55/0xd0 [<0>] do_syscall_64+0x4e/0x140 [<0>] entry_SYSCALL_64_after_hwframe+0x44/0xa9 Do a force, lazy unmount and logout of the iSCSI target: $ umount -f -l /mnt; iscsiadm -m node -T iqn.2014-08.com.example:t1 -p 10.71.57.19 --logout Logging out of session [sid: 1, target: iqn.2014-08.com.example:t1, portal: 10.71.57.19,3260] Logout of [sid: 1, target: iqn.2014-08.com.example:t1, portal: 10.71.57.19,3260] successful. Most of the time this results in the above errors (BUG() in iput() and NULL pointer dereference in gfs2_logd()). I think this may be happening because gfs2_logd does a withdraw which does iput() on the journal inode which evicts it. It then tries to reload it and fails (because of IO errors?). When gfs2_put_super() is called, it does iput() on the same inode again and hits the BUG(). Any ideas on this? Thanks, Ross