On Wed, 2008-08-13 at 22:39 +0900, Alex Lee wrote: > I have a system thats been spitting out OST disconnect messages under > heavy load. I'm guessing the OST eventually reconnects. > I want to say this happens when the OSS is extremely overloaded but I > did notice this happening even under light load. Only the OSS seems to > spit out any error messages. I dont see anything on the client side. > > Should I be concern? Or does this typically happen on other sites too? > > -Alex > > clip off one of the OSS: > > Aug 13 17:26:48 lustre-oss-0-1 kernel: LustreError: 137-5: UUID > 'lfs-OST0004_UUID' is not available for connect (no target)
This means that the device an OSS is using for an OST has become unavailable (i.e. ENODEV -- No such device). The question becomes, why? What kind of disk is the OST? You might want to look into your storage hardware's logs to see if there is any indication of troubles. b.
signature.asc
Description: This is a digitally signed message part
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
