Dear all,

in the logs of our 3 MDS (Lustre 2.12.5), on NID is continously reported as 
missing, e.g.

> LNet: 3571:0:(o2iblnd_cb.c:3397:kiblnd_check_conns()) Timed out tx for 
10.20.1.237@o2ib5: 1 seconds

Once upon a time, there was indeed a batch node with that IP. It has been decomissioned and switched off some time last year, but still crops up in the logs.

Recently I managed to revive the old hardware, connect it to Lustre. The MDS 
recognized the node. Then I umounted, which worked without a problem.
Nevertheless, the NID keeps reappearing in the logs.

Any way to make the MDSes understand that this box is gone for good? Perhaps 
install a new one with the old IP?

Regards,
Thomas

--
--------------------------------------------------------------------
Thomas Roth
Department: IT
Location: SB3 2.291
Phone: +49-6159-71 1453  Fax: +49-6159-71 2986

GSI Helmholtzzentrum für Schwerionenforschung GmbH
Planckstraße 1, 64291 Darmstadt, Germany, www.gsi.de

Commercial Register / Handelsregister: Amtsgericht Darmstadt, HRB 1528
Managing Directors / Geschäftsführung:
Professor Dr. Paolo Giubellino, Dr. Ulrich Breuer, Jörg Blaurock
Chairman of the Supervisory Board / Vorsitzender des GSI-Aufsichtsrats:
State Secretary / Staatssekretär Dr. Volkmar Dietz

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to