Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-27 Thread Robert Watzlavick
Just to close out the discussion, I wasn't able to prove any issues with 
ZFS.  The files that were changed all seem to have plausible scenarios.


I've moved my external USB drive backups over to ZFS directly connected 
to the file server and it's all working fine.


Thanks for everyone's help!

-Bob


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-24 Thread Garrett D'Amore
You're using an *old* version of both OpenSolaris and zpool.  There have been a 
few corruption bugs fixed since then.  I'd recommend updating.

- Garrett

On Oct 22, 2011, at 9:27 AM, Robert Watzlavick wrote:

 I've noticed something strange over the past few months with four files on my 
 raidz.  Here's the setup:
 OpenSolaris snv_111b
 ZFS Pool version 14
 AMD-based server with ECC RAM.
 5 ST3500630AS 500 GB SATA drives (4 active plus spare) in raidz1
 
 The other day, I observed what appears to be undetected file corruption in 4 
 of the files on the raidz.  I have two external USB hard drives that I use to 
 back up the contents of the ZFS raidz on alternating months.  The USB hard 
 drives use EXT3 so they are connected to a Linux box which in turn connects 
 to the raidz over NFS.  Occasionally, I use the checksum option on rsync 
 (rsync -ainc) to make sure everything on the USB hard drives match before I 
 perform the real rsync back from the raid to the USB disk and that's when I 
 noticed the changes.  In each file, there was a single byte changed.  Running 
 zpool status doesn't show any errors and running zpool scrub doesn't show any 
 problems either.
 
 One of the changed files was a .ppt file that I downloaded from the web over 
 a year ago and the other 3 were Acronis incremental Backup files from my XP 
 machine that get stored on the raidz.  Since ZFS files aren't supposed to be 
 corrupted without notification (right?), I initially assumed the problem was 
 with the USB drive.  For the 3 Acronis backup files, I had no way of knowing 
 which version was the correct one because Acronis shows all of them to be 
 valid.  The .ppt file was not on the web anymore but with the help of the 
 Wayback machine, I was able to re-download it and that's when I confirmed the 
 good copy from the web matches the copy on my USB hard drive, not the copy 
 on the raidz.  I know I haven't modified the .ppt file because the date still 
 matches the date I downloaded it, 2010-01-12.
 
 What failure scenario could have caused this?  The file was obviously 
 initially good on the raidz because it got backed up to the USB drive and 
 that matches the good version from the web.
 
 Thanks in advance,
 -Bob
 
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-24 Thread Edward Ned Harvey
 From: Robert Watzlavick [mailto:rob...@watzlavick.com]
 Sent: Sunday, October 23, 2011 4:36 PM
 
 Now on to find out why the 3 Acronis Backup files got modified.  This is
 good news so far...

I expect you'll find the same thing for Acronis.  Acronis updates those
individual files to make them aware of each other.  When you open file1.tib,
it knows it's part of a backup set (version chain) that includes file0,
file1, file2, file3.  Of course, this is probably done by Acronis writing a
little bit to file1.tib.

Heaven forefend you should want to backup a truecrypt file.

There is a reason why backups stopped depending on timestamps years ago.
Welcome and praise zfs send.

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-24 Thread Paul Kraus
On Sat, Oct 22, 2011 at 12:27 PM, Robert Watzlavick
rob...@watzlavick.com wrote:

 What failure scenario could have caused this?  The file was obviously
 initially good on the raidz because it got backed up to the USB drive and
 that matches the good version from the web.

I ran into a similar failure with a ZFS shared via SAMBA. The
data has ACLs on them to permit new data to be added, but nothing
modified or removed. When testing the configuration the end users had
no problems using Windows to copy data into the share. When they used
a specific tool to copy (and then verify via checksum) the data, it
was occasionally flagging a bad copy. Turns out that the tool was
actually copying _more_ data than was in the original and then going
back and removing the extra white space at the end of the file (the
files matched byte for byte up until the original ended and the copy
did not). The ZFS ACL was doing what the end user needed (no
modifications).

I reported the problem to the folks who wrote the tool and never
heard anything back. My end users have stopped using that tool for
copies, they still use it for verifications (and for that it is fine
as it does not try to change the data).

This was originally reported to me as a problem with ZFS, SAMBA,
or the ACLs I had set up. It is amazing how much _changing_ of data
goes on with no knowledge by the end users.

-- 
{1-2-3-4-5-6-7-}
Paul Kraus
- Senior Systems Architect, Garnet River ( http://www.garnetriver.com/ )
- Sound Coordinator, Schenectady Light Opera Company (
http://www.sloctheater.org/ )
- Technical Advisor, RPI Players
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-24 Thread Edward Ned Harvey
 From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
 boun...@opensolaris.org] On Behalf Of Robert Watzlavick
 
 I have two external USB hard drives
 that I use to back up the contents of the ZFS raidz on alternating
 months.  The USB hard drives use EXT3 so they are connected to a Linux
 box which in turn connects to the raidz over NFS.  Occasionally, I use
 the checksum option on rsync (rsync -ainc) to make sure everything on
 the USB hard drives match before I perform the real rsync back from the
 raid to the USB disk and that's when I noticed the changes.  

I would suggest finding a way to connect the external disks directly to the
ZFS server, and start using zfs send instead.

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-24 Thread Robert Watzlavick


On Oct 24, 2011, at 9:42, Edward Ned Harvey 
opensolarisisdeadlongliveopensola...@nedharvey.com wrote:

 
 I would suggest finding a way to connect the external disks directly to the
 ZFS server, and start using zfs send instead.
 
Since these were my offsite backups I was using Truecrypt which drove the use 
of ext3 and Linux. Also I wanted to be sure in the event of a disaster I could 
easily find a machine to read them. I had reservations about just any machine 
being able to boot the 2009.06  live cd and read them.   But now that more 
distros are starting to support ZFS, I'll probably switch over to it for the 
external USB drive backups. 

-Bob
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] File contents changed with no ZFS error

2011-10-22 Thread Robert Watzlavick
I've noticed something strange over the past few months with four files 
on my raidz.  Here's the setup:

OpenSolaris snv_111b
ZFS Pool version 14
AMD-based server with ECC RAM.
5 ST3500630AS 500 GB SATA drives (4 active plus spare) in raidz1

The other day, I observed what appears to be undetected file corruption 
in 4 of the files on the raidz.  I have two external USB hard drives 
that I use to back up the contents of the ZFS raidz on alternating 
months.  The USB hard drives use EXT3 so they are connected to a Linux 
box which in turn connects to the raidz over NFS.  Occasionally, I use 
the checksum option on rsync (rsync -ainc) to make sure everything on 
the USB hard drives match before I perform the real rsync back from the 
raid to the USB disk and that's when I noticed the changes.  In each 
file, there was a single byte changed.  Running zpool status doesn't 
show any errors and running zpool scrub doesn't show any problems either.


One of the changed files was a .ppt file that I downloaded from the web 
over a year ago and the other 3 were Acronis incremental Backup files 
from my XP machine that get stored on the raidz.  Since ZFS files aren't 
supposed to be corrupted without notification (right?), I initially 
assumed the problem was with the USB drive.  For the 3 Acronis backup 
files, I had no way of knowing which version was the correct one because 
Acronis shows all of them to be valid.  The .ppt file was not on the web 
anymore but with the help of the Wayback machine, I was able to 
re-download it and that's when I confirmed the good copy from the web 
matches the copy on my USB hard drive, not the copy on the raidz.  I 
know I haven't modified the .ppt file because the date still matches the 
date I downloaded it, 2010-01-12.


What failure scenario could have caused this?  The file was obviously 
initially good on the raidz because it got backed up to the USB drive 
and that matches the good version from the web.


Thanks in advance,
-Bob

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-22 Thread Edward Ned Harvey
 From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
 boun...@opensolaris.org] On Behalf Of Robert Watzlavick
  
 What failure scenario could have caused this?  The file was obviously
 initially good on the raidz because it got backed up to the USB drive
 and that matches the good version from the web.

How can you outrule the possibility of something changed the file.
Intentionally, not as a form of filesystem corruption. 

If you have snapshots on your ZFS filesystem, you can use zhist (or whatever
technique you want) to see in which snapshot(s) it changed, and find all the
unique versions of it.  'Course that will only give you any valuable
information if you have different versions of the file in different
snapshots.

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-22 Thread Robert Watzlavick
On Oct 22, 2011, at 13:14, Edward Ned Harvey 
opensolarisisdeadlongliveopensola...@nedharvey.com wrote:
 
 How can you outrule the possibility of something changed the file.
 Intentionally, not as a form of filesystem corruption.

I suppose that's possible but seems unlikely. One byte on a file changed on the 
disk with no corresponding change in the mod time seems unlikely. I did access 
that file for read sometime I'm the past few months but again, if it had 
accidentally been written to, the time would have been updated. 
 
 If you have snapshots on your ZFS filesystem, you can use zhist (or whatever
 technique you want) to see in which snapshot(s) it changed, and find all the
 unique versions of it.  'Course that will only give you any valuable
 information if you have different versions of the file in different
 snapshots.
 
I only have one or two snapshots but I'll look. 

Thanks,
-Bob
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-22 Thread Robert Watzlavick

On Oct 22, 2011, at 12:55, Garrett D'Amore garrett.dam...@nexenta.com wrote:

 You're using an *old* version of both OpenSolaris and zpool.  There have been 
 a few corruption bugs fixed since then.  I'd recommend updating.
 
- Garrett
 
I was looking for the changelist to see if any have been fixed but couldnt find 
it. This is  2009.06 with some updates but not the final set.  It's just a 
fileserver so I was tempted to freeze the config since it's not on the net and 
it has been working so well (then again maybe not). Do you know if the final 
updates to 2009.06 can still be applied?

-Bob

 On Oct 22, 2011, at 9:27 AM, Robert Watzlavick wrote:
 
 I've noticed something strange over the past few months with four files on 
 my raidz.  Here's the setup:
 OpenSolaris snv_111b
 ZFS Pool version 14
 AMD-based server with ECC RAM.
 5 ST3500630AS 500 GB SATA drives (4 active plus spare) in raidz1
 
 The other day, I observed what appears to be undetected file corruption in 4 
 of the files on the raidz.  I have two external USB hard drives that I use 
 to back up the contents of the ZFS raidz on alternating months.  The USB 
 hard drives use EXT3 so they are connected to a Linux box which in turn 
 connects to the raidz over NFS.  Occasionally, I use the checksum option on 
 rsync (rsync -ainc) to make sure everything on the USB hard drives match 
 before I perform the real rsync back from the raid to the USB disk and 
 that's when I noticed the changes.  In each file, there was a single byte 
 changed.  Running zpool status doesn't show any errors and running zpool 
 scrub doesn't show any problems either.
 
 One of the changed files was a .ppt file that I downloaded from the web over 
 a year ago and the other 3 were Acronis incremental Backup files from my XP 
 machine that get stored on the raidz.  Since ZFS files aren't supposed to be 
 corrupted without notification (right?), I initially assumed the problem was 
 with the USB drive.  For the 3 Acronis backup files, I had no way of knowing 
 which version was the correct one because Acronis shows all of them to be 
 valid.  The .ppt file was not on the web anymore but with the help of the 
 Wayback machine, I was able to re-download it and that's when I confirmed 
 the good copy from the web matches the copy on my USB hard drive, not the 
 copy on the raidz.  I know I haven't modified the .ppt file because the date 
 still matches the date I downloaded it, 2010-01-12.
 
 What failure scenario could have caused this?  The file was obviously 
 initially good on the raidz because it got backed up to the USB drive and 
 that matches the good version from the web.
 
 Thanks in advance,
 -Bob
 
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
 
 
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] File contents changed with no ZFS error

2011-10-22 Thread Mark Sandrock
Why don't you see which byte differs, and how it does?
Maybe that would suggest the failure mode. Is it the
same byte data in all affected files, for instance?

Mark

Sent from my iPhone

On Oct 22, 2011, at 2:08 PM, Robert Watzlavick rob...@watzlavick.com wrote:

 On Oct 22, 2011, at 13:14, Edward Ned Harvey 
 opensolarisisdeadlongliveopensola...@nedharvey.com wrote:
 
 How can you outrule the possibility of something changed the file.
 Intentionally, not as a form of filesystem corruption.
 
 I suppose that's possible but seems unlikely. One byte on a file changed on 
 the disk with no corresponding change in the mod time seems unlikely. I did 
 access that file for read sometime I'm the past few months but again, if it 
 had accidentally been written to, the time would have been updated. 
 
 If you have snapshots on your ZFS filesystem, you can use zhist (or whatever
 technique you want) to see in which snapshot(s) it changed, and find all the
 unique versions of it.  'Course that will only give you any valuable
 information if you have different versions of the file in different
 snapshots.
 
 I only have one or two snapshots but I'll look. 
 
 Thanks,
 -Bob
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss