Re: [zfs-discuss] Re: Re[2]: ZFS in a SAN environment

2006-12-20 Thread Jonathan Edwards


On Dec 20, 2006, at 00:37, Anton B. Rang wrote:

INFORMATION: If a member of this striped zpool becomes  
unavailable or
develops corruption, Solaris will kernel panic and reboot to  
protect your data.


OK, I'm puzzled.

Am I the only one on this list who believes that a kernel panic,  
instead of EIO, represents a bug?


I agree as well - did you file a bug on this yet?

Inducing kernel panics (like we also do on certain sun cluster  
failure types) to prevent corruption can often lead to more  
corruption elsewhere, and usually ripples to throw admins, managers,  
and users in a panic as well - typically resulting in  more corrupted  
opinions and perceptions of reliability and usability.  :)


---
.je
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Re: Re[2]: ZFS in a SAN environment

2006-12-20 Thread Richard Elling

Dennis Clarke wrote:

Anton B. Rang wrote:

INFORMATION: If a member of this striped zpool becomes unavailable or
develops corruption, Solaris will kernel panic and reboot to protect your
data.


Is this the official, long-term stance?  I don't think it is.  I think this
is an interpretation of current state.


OK, I'm puzzled.

Am I the only one on this list who believes that a kernel panic, instead
of EIO, represents a bug?



Nope.  I'm with you.


no no .. its a feature.  :-P

If it walks like a duck and quacks like a duck then its a duck.

a kernel panic that brings down a system is a bug.  Plain and simple.


I disagree (nit).  A hardware fault can also cause a panic.  Faults != bugs.
I do agree in principle, though.  Panics should be avoided whenever possible.

Incidentally, we do track the panic rate and collect panic strings.  The
last detailed analysis I saw on the data showed that the vast majority were
hardware induced.  This was a bit of a bummer because we were hoping that
the tracking data would lead to identifying software bugs.
 -- richard

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Re: Re[2]: ZFS in a SAN environment

2006-12-20 Thread Dennis Clarke


 no no .. its a feature.  :-P

 If it walks like a duck and quacks like a duck then its a duck.

 a kernel panic that brings down a system is a bug.  Plain and simple.

 I disagree (nit).  A hardware fault can also cause a panic.  Faults != bugs.

  ha ha .. yeah.  If the sysadm walks over to a machine an pour coffee in
it then I guess it will fault all over the place.  No appreciation for
coffee I guess.

however ... when it comes to storage I expect that a disk failure or
hot swap will not cause a fault if and only if there still remains some
other storage device that holds the bits in a redundant fashion.

so .. disks can fail.  That should be okay.

even memory and processors can fail.  within reason.

 I do agree in principle, though.  Panics should be avoided whenever
 possible.

 coffee spillage also ..

 Incidentally, we do track the panic rate and collect panic strings.  The
 last detailed analysis I saw on the data showed that the vast majority were
 hardware induced.  This was a bit of a bummer because we were hoping that
 the tracking data would lead to identifying software bugs.

but it does imply that the software is way better than the hardware eh ?

-- 
Dennis Clarke

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Re: Re[2]: ZFS in a SAN environment

2006-12-19 Thread Torrey McMahon

Anton B. Rang wrote:

INFORMATION: If a member of this striped zpool becomes unavailable or
develops corruption, Solaris will kernel panic and reboot to protect your data.



OK, I'm puzzled.

Am I the only one on this list who believes that a kernel panic, instead of 
EIO, represents a bug?
 
  



Nope.  I'm with you.


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Re: Re[2]: ZFS in a SAN environment

2006-12-19 Thread Dennis Clarke

 Anton B. Rang wrote:
 INFORMATION: If a member of this striped zpool becomes unavailable or
 develops corruption, Solaris will kernel panic and reboot to protect your
 data.


 OK, I'm puzzled.

 Am I the only one on this list who believes that a kernel panic, instead
 of EIO, represents a bug?


 Nope.  I'm with you.

no no .. its a feature.  :-P

If it walks like a duck and quacks like a duck then its a duck.

a kernel panic that brings down a system is a bug.  Plain and simple.

Dennis

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss