i'm seeing this error show up on many of my nodes many times (usually in big spurts). i suspect i'm having some network congestion issues, but i haven't narrowed it down yet. but i'm unclear what the error actually signifies
lneterror: 10164:0:(peer.c:3451:lnet_peer_ni_add_to_recoveryq_locked()) lpni <address> added to recovery queue. Health = 900 _______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
