i'm seeing this error show up on many of my nodes many times (usually
in big spurts).  i suspect i'm having some network congestion issues,
but i haven't narrowed it down yet.  but i'm unclear what the error
actually signifies

lneterror: 10164:0:(peer.c:3451:lnet_peer_ni_add_to_recoveryq_locked())
lpni <address> added to recovery queue.  Health = 900
_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to