Hi, I am getting these errors on all our MDS and OSS servers (Lustre 2.10.1):
Aug 11 11:45:52 ndc-oss5b kernel: LNet: 24727:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119@o2ib version 12/12 incarnation 1533927051163335/1533998625080752 Aug 11 11:55:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119@o2ib version 12/12 incarnation 1533927051163335/1533998625080752 Aug 11 12:05:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119@o2ib version 12/12 incarnation 1533927051163335/1533998625080752 Aug 11 12:15:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119@o2ib version 12/12 incarnation 1533927051163335/1533998625080752 Aug 11 12:25:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119@o2ib version 12/12 incarnation 1533927051163335/1533998625080752 This is a new node we brought online recently. Is it an indication that we have problem with it OPA interface on the node? This machine has a 8160F CPU (OPA interface on chip). Thanks, Lixin Liu High Performance Computing Simon Fraser University
_______________________________________________ lustre-discuss mailing list lustre-discuss@lists.lustre.org http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org