Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Ray Arachelian
On 11/16/2012 07:15 PM, Peter Jeremy wrote: I have been tracking down a problem with zfs diff that reveals itself variously as a hang (unkillable process), panic or error, depending on the ZFS kernel version but seems to be caused by corruption within the pool. I am using FreeBSD but the issue

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Peter Jeremy
On 2012-Nov-19 11:02:06 -0500, Ray Arachelian r...@arachelian.com wrote: Is the pool importing properly at least? Maybe you can create another volume and transfer the data over for that volume, then destroy it? The pool is imported and passes all tests except zfs diff. Creating another pool

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Freddie Cash
On Mon, Nov 19, 2012 at 9:03 AM, Peter Jeremy pe...@rulingia.com wrote: On 2012-Nov-19 11:02:06 -0500, Ray Arachelian r...@arachelian.com wrote: Is the pool importing properly at least? Maybe you can create another volume and transfer the data over for that volume, then destroy it? The pool is

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Ray Arachelian
On 11/19/2012 12:03 PM, Peter Jeremy wrote: On 2012-Nov-19 11:02:06 -0500, Ray Arachelian r...@arachelian.com wrote: The damage exists in the oldest snapshot for that filesystem. Are you able to delete that snapshot? ___ zfs-discuss mailing list

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Peter Jeremy
On 2012-Nov-19 13:47:01 -0500, Ray Arachelian r...@arachelian.com wrote: On 11/19/2012 12:03 PM, Peter Jeremy wrote: The damage exists in the oldest snapshot for that filesystem. Are you able to delete that snapshot? Yes but it has no effect - the corrupt object exists in the current pool so

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Mark Shellenbaum
On 11/16/12 17:15, Peter Jeremy wrote: I have been tracking down a problem with zfs diff that reveals itself variously as a hang (unkillable process), panic or error, depending on the ZFS kernel version but seems to be caused by corruption within the pool. I am using FreeBSD but the issue looks

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Jim Klimov
On 2012-11-19 20:28, Peter Jeremy wrote: Yep - that's the fallback solution. With 1874 snapshots spread over 54 filesystems (including a couple of clones), that's a major undertaking. (And it loses timestamp information). Well, as long as you have and know the base snapshots for the clones,

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Jim Klimov
On 2012-11-19 20:58, Mark Shellenbaum wrote: There is probably nothing wrong with the snapshots. This is a bug in ZFS diff. The ZPL parent pointer is only guaranteed to be correct for directory objects. What you probably have is a file that was hard linked multiple times and the parent

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Jim Klimov
Oh, and one more thing: rsync is only good if your filesystems don't really rely on ZFS/NFSv4-style ACLs. If you need those, you are stuck with Solaris tar or Solaris cpio to carry the files over, or you have to script up replication of ACLs after rsync somehow. You should also replicate the

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Tomas Forsman
On 19 November, 2012 - Jim Klimov sent me these 1,1K bytes: Oh, and one more thing: rsync is only good if your filesystems don't really rely on ZFS/NFSv4-style ACLs. If you need those, you are stuck with Solaris tar or Solaris cpio to carry the files over, or you have to script up replication

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Mark Shellenbaum
On 11/19/12 1:14 PM, Jim Klimov wrote: On 2012-11-19 20:58, Mark Shellenbaum wrote: There is probably nothing wrong with the snapshots. This is a bug in ZFS diff. The ZPL parent pointer is only guaranteed to be correct for directory objects. What you probably have is a file that was hard

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Jim Klimov
On 2012-11-19 22:38, Mark Shellenbaum wrote: The parent pointer is a single 64 bit quantity that can't track all the possible parents a hard linked file could have. I believe it is inode number of the parent, or similar to that - and an available inode number can get recycled and used by newer

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Peter Jeremy
On 2012-Nov-19 21:10:56 +0100, Jim Klimov jimkli...@cos.ru wrote: On 2012-11-19 20:28, Peter Jeremy wrote: Yep - that's the fallback solution. With 1874 snapshots spread over 54 filesystems (including a couple of clones), that's a major undertaking. (And it loses timestamp information). Well,

Re: [zfs-discuss] Repairing corrupted ZFS pool

2012-11-19 Thread Peter Jeremy
On 2012-Nov-19 14:38:30 -0700, Mark Shellenbaum mark.shellenb...@oracle.com wrote: On 11/19/12 1:14 PM, Jim Klimov wrote: On 2012-11-19 20:58, Mark Shellenbaum wrote: There is probably nothing wrong with the snapshots. This is a bug in ZFS diff. The ZPL parent pointer is only guaranteed to