Hi,

One of the jobs running on our cluster died. After investigating logs I can clearly see that job died because client that was running that job was evicted by MDS Nov 14 13:31:27 node-i06 kernel: LustreError: 167-0: This client was evicted by ddn_home-MDT0000; in progress operations using this service will fail.

On MDS I can not see any messages about this client being evicted. There is only one message about this client: Nov 14 13:31:27 mds01.beowulf.cluster kernel: LustreError: 22512:0: (handler.c:1498:mds_handle()) operation 400 on unconnected MDS from [EMAIL PROTECTED]

Can some one explain me what exactly had happened? Is this problem can be related to https://bugzilla.lustre.org/show_bug.cgi?id=13682

Best regards,

Wojciech Turek


_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to