-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jun 27, 2008, at 1:07 PM, Brian J. Murrell wrote: > On Fri, 2008-06-27 at 12:44 -0400, Brock Palen wrote: >> >> All of them are stuck in un-interruptible sleep. >> Has anyone seen this happen before? Is this caused by a pending disk >> failure? > > Well, they are certainly stuck because of some blocking I/O. That > could > be disk failure, indeed. > >> mptscsi: ioc1: attempting task abort! (sc=0000010038904c40) >> scsi1 : destination target 0, lun 0 >> command = Read (10) 00 75 94 40 00 00 10 00 00 >> mptscsi: ioc1: task abort: SUCCESS (sc=0000010038904c40) > > That does not look like a picture of happiness, indeed, no. You have > SCSI commands aborting.
While the array was reporting no problems one of the disk was really lagging the others. We have swapped it out. Thanks for the feedback everyone. > >> Lustre: 6698:0:(lustre_fsfilt.h:306:fsfilt_setattr()) nobackup- >> OST0001: slow setattr 100s >> Lustre: 6698:0:(watchdog.c:312:lcw_update_time()) Expired watchdog >> for pid 6698 disabled after 103.1261s > > Those are just fallout from the above disk situation. > > b. > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (Darwin) iD8DBQFIZUq/MFCQB4Bvz5QRAvacAJ9jkhi+2KgfbJ7bUI/KfHJ0Hnq1wQCeNgHO d6+tzscwCqwYtuHXmzT2kFI= =5p1N -----END PGP SIGNATURE----- _______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss