Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-07 Thread Jeremy Teo
The whole raid does not fail -- we are talking about corruption here. If you lose some inodes your whole partition is not gone. My ZFS pool would not salvage -- poof, whole thing was gone (granted it was a test one and not a raidz or mirror yet). But still, for what happened, I cannot believe

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Casper . Dik
While other file systems, when they become corrupt, allow you to salvage data :-) They allow you to salvage what you *think* is your data. But in reality, you have no clue what the disks are giving you. Casper ___ zfs-discuss mailing list

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Al Hopper
On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 12:06 AM, Ian Collins wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 10:17 PM, Ian Collins wrote: Chad Leigh -- Shire.Net LLC wrote: There is not? People buy disk drives and expect them to

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Al Hopper
On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: While other file systems, when they become corrupt, allow you to salvage data :-) They allow you to salvage what you *think* is your data. But in reality, you have no

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Toby Thain
On 2-Dec-06, at 12:56 PM, Al Hopper wrote: On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: While other file systems, when they become corrupt, allow you to salvage data :-) They allow you to salvage what you *think* is your

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Chad Leigh -- Shire.Net LLC
On Dec 2, 2006, at 10:56 AM, Al Hopper wrote: On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: While other file systems, when they become corrupt, allow you to salvage data :-) They allow you to salvage what you *think* is

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Jeff Victor
Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 10:56 AM, Al Hopper wrote: On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: While other file systems, when they become corrupt, allow you to salvage data :-) They allow

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Chad Leigh -- Shire.Net LLC
On Dec 2, 2006, at 12:29 PM, Jeff Victor wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 10:56 AM, Al Hopper wrote: On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: While other file systems, when they become

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Dick Davies
On 02/12/06, Chad Leigh -- Shire.Net LLC [EMAIL PROTECTED] wrote: On Dec 2, 2006, at 10:56 AM, Al Hopper wrote: On Sat, 2 Dec 2006, Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 6:01 AM, [EMAIL PROTECTED] wrote: When you have subtle corruption, some of the data and meta data

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-02 Thread Rich Teer
On Sat, 2 Dec 2006, Al Hopper wrote: Some people on this list think that the RAID arrays are more likely to corrupt your data than JBOD (both with ZFS on top, for example, a ZFS mirror of 2 raid arrays or a JBOD mirror or raidz). There is no Can you present a cut/paste where that

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Al Hopper
On Tue, 28 Nov 2006, Elizabeth Schwartz wrote: Well, I fixed the HW but I had one bad file, and the problem was that ZFS Hi Elizabeth, Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you that your trusted RAID H/W did, in

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Chad Leigh -- Shire.Net LLC
On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you that your trusted RAID H/W did, in fact, have issues? Do you think that it's likely that there are others running production

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Dana H. Myers
Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you that your trusted RAID H/W did, in fact, have issues? Do you think that it's likely

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Chad Leigh -- Shire.Net LLC
On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you that your trusted RAID H/W did, in

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Dana H. Myers
Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Ian Collins
Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: And this is different from any other storage system, how? (ie, JBOD controllers and disks can also have subtle bugs that corrupt data) Of course, but there isn't the

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Toby Thain
On 1-Dec-06, at 6:29 PM, Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience with ZFS convinced you that your trusted RAID H/W did, in fact, have issues? Do you

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Toby Thain
On 1-Dec-06, at 6:36 PM, Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm curious as to what you found and if this experience

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Chad Leigh -- Shire.Net LLC
On Dec 1, 2006, at 10:17 PM, Ian Collins wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: And this is different from any other storage system, how? (ie, JBOD controllers and disks can also have subtle bugs

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Chad Leigh -- Shire.Net LLC
On Dec 1, 2006, at 10:42 PM, Toby Thain wrote: On 1-Dec-06, at 6:36 PM, Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 4:34 PM, Dana H. Myers wrote: Chad Leigh -- Shire.Net LLC wrote: On Dec 1, 2006, at 9:50 AM, Al Hopper wrote: Followup: When you say you fixed the HW, I'm

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Ian Collins
Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 12:06 AM, Ian Collins wrote: But people expect RAID to protect them from the corruption caused by a partial failure, say a bad block, which is a common failure mode. They do? I must admit no experience with the big standalone raid

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-12-01 Thread Dana H. Myers
Chad Leigh -- Shire.Net LLC wrote: On Dec 2, 2006, at 12:06 AM, Ian Collins wrote: [...] I don't think that the issue here, it's more one of perceived data integrity. People who have been happily using a single RAID 5 are now finding that the array has been silently corrupting their data.

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-11-29 Thread Cindy Swearingen
Hi Betsy, Yes, part of this is a documentation problem. I recently documented the find -inum scenario in the community version of the admin guide. Please see page 156, (well, for next time) here: http://opensolaris.org/os/community/zfs/docs/ We're working on the larger issue as well. Cindy

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-11-28 Thread Elizabeth Schwartz
Well, I fixed the HW but I had one bad file, and the problem was that ZFS was saying delete the pool and restore from tape when, it turns out, the answer is just find the file with the bad inode, delete it, clear the device and scrub. Maybe more of a documentation problme, but it sure is

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-11-28 Thread Toby Thain
On 28-Nov-06, at 10:01 PM, Elizabeth Schwartz wrote: Well, I fixed the HW but I had one bad file, and the problem was that ZFS was saying delete the pool and restore from tape when, it turns out, the answer is just find the file with the bad inode, delete it, clear the device and scrub.

Re: [zfs-discuss] Re: Production ZFS Server Death (06/06)

2006-11-28 Thread David Dyer-Bennet
On 11/28/06, Elizabeth Schwartz [EMAIL PROTECTED] wrote: Well, I fixed the HW but I had one bad file, and the problem was that ZFS was saying delete the pool and restore from tape when, it turns out, the answer is just find the file with the bad inode, delete it, clear the device and scrub.