Alexey Lyashkov a écrit : > Hi Aurelien, > > That message you can see in two cases > 1) low level network error, that bad - because client will be reconnected and > resend requests after that error. > that will add extra load to the service nodes. > > 2) service node (MDS, OSS) is restarted or hung, at that case transfer > aborted.
In our cases nodes were not restarted, so the infiniband network seems to have issues. But these errors could be ignored as long as they do not appear to often. -- Aurelien Degremont CEA _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
