Re: [Lustre-discuss] OSS load in the roof

Brian J. Murrell Fri, 27 Jun 2008 10:07:45 -0700

On Fri, 2008-06-27 at 12:44 -0400, Brock Palen wrote:
> 
> All of them are stuck in un-interruptible sleep.
> Has anyone seen this happen before?  Is this caused by a pending disk  
> failure?


Well, they are certainly stuck because of some blocking I/O.  That could
be disk failure, indeed.

> mptscsi: ioc1: attempting task abort! (sc=0000010038904c40)
> scsi1 : destination target 0, lun 0
>          command = Read (10) 00 75 94 40 00 00 10 00 00
> mptscsi: ioc1: task abort: SUCCESS (sc=0000010038904c40)

That does not look like a picture of happiness, indeed, no.  You have
SCSI commands aborting.

> Lustre: 6698:0:(lustre_fsfilt.h:306:fsfilt_setattr()) nobackup- 
> OST0001: slow setattr 100s
> Lustre: 6698:0:(watchdog.c:312:lcw_update_time()) Expired watchdog  
> for pid 6698 disabled after 103.1261s

Those are just fallout from the above disk situation.

b.

signature.asc
Description: This is a digitally signed message part

_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] OSS load in the roof

Reply via email to