Hi everyone, I am running a lot of CTs with their roots located on an nfs share. Once in a while it happens that a process gets stuck which I fear has something to do with the nfs mount. See the dmesg out below. The problem now is that I can't kill this process anymore. This results into beeing unable to stop the CT running this process. vzctl stop <CTID> runs into a timeout. It is totally impossible to kill the process - The only solution is a reboot of the Host-System.
Is there a way to forcefully kill the CT ? In this case I don't care if the process remains running. I just want the rest of the CT to be stopped so I can start the CT again. Here is the dmes output: [194043.649945] INFO: task which:810615 blocked for more than 120 seconds. [194043.650077] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [194043.650274] which D ffff882f74146d50 0 810615 682640 125 0x00000084 [194043.650281] ffff8825ba732f98 0000000000000086 0000000000000017 ffff8825ba732fc0 [194043.650293] 0000000000000000 0000000000000000 ffff8824d0186d70 0000000000000007 [194043.650301] ffff8825ba732f88 ffff882f74147308 ffff8825ba733fd8 ffff8825ba733fd8 [194043.650308] Call Trace: [194043.650318] [<ffffffff81122480>] ? sync_page+0x0/0x50 [194043.650325] [<ffffffff814e6b73>] io_schedule+0x73/0xc0 [194043.650330] [<ffffffff811224bd>] sync_page+0x3d/0x50 [194043.650334] [<ffffffff814e73da>] __wait_on_bit_lock+0x5a/0xc0 [194043.650339] [<ffffffff81122457>] __lock_page+0x67/0x70 [194043.650345] [<ffffffff81095550>] ? wake_bit_function+0x0/0x50 [194043.650351] [<ffffffff8113ac72>] ? pagevec_lookup+0x22/0x30 [194043.650357] [<ffffffff8113cc8e>] truncate_inode_pages_range+0x43e/0x450 [194043.650397] [<ffffffffa0314b80>] ? nfs_dq_delete_inode+0x0/0xd0 [nfs] [194043.650406] [<ffffffffa03a7c7f>] ? vzquota_data_unlock+0x2f/0x40 [vzdquota] [194043.650421] [<ffffffffa0314b80>] ? nfs_dq_delete_inode+0x0/0xd0 [nfs] [194043.650426] [<ffffffff8113ccb5>] truncate_inode_pages+0x15/0x20 [194043.650441] [<ffffffffa0314b9f>] nfs_dq_delete_inode+0x1f/0xd0 [nfs] [194043.650447] [<ffffffff811ac736>] generic_delete_inode+0xd6/0x1c0 [194043.650451] [<ffffffff811ac885>] generic_drop_inode+0x65/0x80 [194043.650456] [<ffffffff811ab3e2>] iput+0x62/0x70 [194043.650467] [<ffffffffa02fa48e>] nfs_dentry_iput+0x3e/0x60 [nfs] [194043.650472] [<ffffffff811a7c1b>] dentry_iput+0x8b/0x110 [194043.650476] [<ffffffff811a7d9c>] d_kill+0x3c/0x70 [194043.650480] [<ffffffff811a9533>] dput+0xa3/0x1d0 [194043.650485] [<ffffffff8119e30a>] path_put+0x1a/0x40 [194043.650497] [<ffffffffa0301982>] __put_nfs_open_context+0xc2/0xf0 [nfs] [194043.650510] [<ffffffffa0301a90>] put_nfs_open_context+0x10/0x20 [nfs] [194043.650524] [<ffffffffa0311029>] nfs_commitdata_release+0x29/0x40 [nfs] [194043.650537] [<ffffffffa03116c1>] nfs_commit_release+0x31/0x40 [nfs] [194043.650564] [<ffffffffa029dde7>] rpc_release_calldata+0x17/0x20 [sunrpc] [194043.650576] [<ffffffffa029e090>] rpc_free_task+0x50/0x80 [sunrpc] [194043.650588] [<ffffffffa029e115>] rpc_final_put_task+0x55/0x60 [sunrpc] [194043.650600] [<ffffffffa029e150>] rpc_do_put_task+0x30/0x40 [sunrpc] [194043.650612] [<ffffffffa029e190>] rpc_put_task+0x10/0x20 [sunrpc] [194043.650626] [<ffffffffa03105c1>] nfs_initiate_commit+0x131/0x190 [nfs] [194043.650640] [<ffffffffa0311a89>] nfs_commit_inode+0x199/0x250 [nfs] [194043.650646] [<ffffffff8100bb0e>] ? common_interrupt+0xe/0x13 [194043.650658] [<ffffffffa02fe426>] nfs_release_page+0x86/0xa0 [nfs] [194043.650662] [<ffffffff81121800>] try_to_release_page+0x30/0x60 [194043.650668] [<ffffffff8113fc77>] shrink_page_list+0x817/0x9f0 [194043.650673] [<ffffffff81140227>] shrink_inactive_list+0x3d7/0xa40 [194043.650678] [<ffffffff81141308>] shrink_zone+0x5d8/0x9d0 [194043.650684] [<ffffffff81063c4b>] ? dequeue_task_fair+0x12b/0x130 [194043.650689] [<ffffffff8114240d>] __zone_reclaim+0x22d/0x2f0 [194043.650694] [<ffffffff8113eb30>] ? isolate_pages_global+0x0/0x520 [194043.650698] [<ffffffff811425e7>] zone_reclaim+0x117/0x150 [194043.650703] [<ffffffff8113261c>] get_page_from_freelist+0x6ac/0x840 [194043.650709] [<ffffffff814e8eab>] ? _spin_unlock_bh+0x1b/0x20 [194043.650714] [<ffffffff81125177>] ? mempool_free_slab+0x17/0x20 [194043.650720] [<ffffffff81134266>] __alloc_pages_nodemask+0x116/0xb40 [194043.650734] [<ffffffffa03148a9>] ? nfs_dq_update_shrink+0x29/0x120 [nfs] [194043.650739] [<ffffffff8112228e>] ? find_get_page+0x1e/0xa0 [194043.650743] [<ffffffff81123bbc>] ? filemap_fault+0xfc/0x5d0 [194043.650750] [<ffffffff81174e6a>] alloc_pages_vma+0x9a/0x150 [194043.650755] [<ffffffff81155a67>] handle_pte_fault+0xa87/0xf60 [194043.650759] [<ffffffff81156124>] handle_mm_fault+0x1e4/0x2b0 [194043.650765] [<ffffffff811904ea>] ? do_sync_read+0xfa/0x140 [194043.650770] [<ffffffff81042aa9>] __do_page_fault+0x139/0x480 [194043.650776] [<ffffffff814ebe2e>] do_page_fault+0x3e/0xa0 [194043.650780] [<ffffffff814e91d5>] page_fault+0x25/0x30 many thanks and best regards, Sirk -- -- _______________________________________________ Users mailing list [email protected] https://openvz.org/mailman/listinfo/users
