Marc Bevand wrote: > Paul Raines <raines <at> nmr.mgh.harvard.edu> writes: > >> Mar 9 03:22:16 raidsrv03 sata: NOTICE: >> /pci <at> 0,0/pci1022,7458 <at> 1/pci11ab,11ab <at> 1: >> Mar 9 03:22:16 raidsrv03 port 6: device reset >> [...] >> >> The above repeated a few times but now seems to have stopped. >> Running 'hd -c' shows all disks as ok. But it seems like I do have >> a disk problem. But since everything is redundant (zraid) why a >> failed disk should lock up the machine like I saw I don't understand >> unless there is a some bigger issue. >> > > It looks like your Solaris 10U4 install on a Thumper is affected by: > http://bugs.opensolaris.org/view_bug.do?bug_id=6587133 > Which was discussed here: > http://opensolaris.org/jive/thread.jspa?messageID=189256 > http://opensolaris.org/jive/thread.jspa?messageID=163460 > > Apply T-PATCH 127871-02, or upgrade to snv_73, or wait for 10U5. > > I think you jumped to a conclusion that is probably not warranted. First he said that the machine was hung and there were no messages associated with the hang. Later, after rebooting he saw a few messages about a (apparently) single bad sector and the system was not hung and recovered from the error in a reasonable amount of time. When asked, he replied that he had no evidence to connect the two events. At no time did he report anything about DMA timeouts. Please don't jump to conclusions.
Regards, Lida _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss