On Fri, 2008-06-27 at 12:44 -0400, Brock Palen wrote: > > All of them are stuck in un-interruptible sleep. > Has anyone seen this happen before? Is this caused by a pending disk > failure?
Well, they are certainly stuck because of some blocking I/O. That could be disk failure, indeed. > mptscsi: ioc1: attempting task abort! (sc=0000010038904c40) > scsi1 : destination target 0, lun 0 > command = Read (10) 00 75 94 40 00 00 10 00 00 > mptscsi: ioc1: task abort: SUCCESS (sc=0000010038904c40) That does not look like a picture of happiness, indeed, no. You have SCSI commands aborting. > Lustre: 6698:0:(lustre_fsfilt.h:306:fsfilt_setattr()) nobackup- > OST0001: slow setattr 100s > Lustre: 6698:0:(watchdog.c:312:lcw_update_time()) Expired watchdog > for pid 6698 disabled after 103.1261s Those are just fallout from the above disk situation. b.
signature.asc
Description: This is a digitally signed message part
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
