Re: [Ocfs2-users] fsck.ocfs2 not fixing as it outputs errors when checking w/ no flag (-fn) but is clean with yes flag (-fy)

2016-04-01 Thread Jay V
On 3/31/2016 10:37 PM, Junxiao Bi wrote: > On 04/01/2016 11:20 AM, Jay Vasa wrote: >> On 3/31/2016 6:36 PM, Herbert van den Bergh wrote: >>> It seems to me that the reason fsck -fn is reporting errors is because >>> it isn't replaying the journal: >>> >>> ** Skipping journal replay because -n was

Re: [Ocfs2-users] fsck.ocfs2 not fixing as it outputs errors when checking w/ no flag (-fn) but is clean with yes flag (-fy)

2016-03-31 Thread Jay Vasa
On 3/31/2016 6:28 PM, Junxiao Bi wrote: > On 04/01/2016 09:21 AM, Jay Vasa wrote: >> I never did an fsck -fn with it being mounted. I understand that will >> cause cause errors. >> It has never been mounted whenever I did any fsck, either -fn -fy. I was >> trying to say that it sucks that I have

Re: [Ocfs2-users] fsck.ocfs2 not fixing as it outputs errors when checking w/ no flag (-fn) but is clean with yes flag (-fy)

2016-03-29 Thread Eric Ren
Hi, >> So, we have 2 problems now. >> What's the matter with fsck? >> It's very weired:-/ > Yes very weird. The main issue is that I need to fsck this filesystem. I > hope Junxiao can help. >> How this error happend in kernel? >> If there's not solution available right now, none of them is easy

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-25 Thread Joseph Qi
Hi Michael, Yes, currently the best way is to copy out data as much as possible and recreate the ocfs2 volume, then restore back the data. I haven't encountered this issue before and don't know which case can lead to it, so I'm sorry I can't give you the advice which can avoid this issue. But I

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-25 Thread Michael Ulbrich
Joseph, thanks again for your help! Currently I'm dumping out 4 TB of data from the broken ocfs2 device to an external disk. I have shut down the cluster and have the fs mounted read-only on a single node. It seems that the data structures are still intact and that the file system problems are

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Joseph Qi
Hi Michael, On 2016/3/24 21:47, Michael Ulbrich wrote: > Hi Joseph, > > thanks for this information although this does not sound too optimistic ... > > So, if I understand you correctly, if we had a metadata backup from > o2image _before_ the crash we could have looked up the missing info to >

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Michael Ulbrich
Hi Joseph, thanks for this information although this does not sound too optimistic ... So, if I understand you correctly, if we had a metadata backup from o2image _before_ the crash we could have looked up the missing info to remove the loop from group chain 73, right? But how could the loop

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Joseph Qi
Hi Michael, So I think the block of record #153 goes wrong, which points next to block 4083643392 of record #19. But the problem is we don't know the right info of the block of record #153, otherwise we can dd out, edit it and then dd in to fix it. Thanks, Joseph On 2016/3/24 18:38, Michael

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Michael Ulbrich
Hi Joseph, ok, got it! Here's the loop in chain 73: Group Chain: 73 Parent Inode: 13 Generation: 1172963971 CRC32: ECC: ## Block#TotalUsed Free Contig Size 0428077363215872114874385 1774 1984 1258326323215872

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Joseph Qi
Hi Michael, It seems that dead loop happens in chain 73. You have formatted using 2K block and 4K cluster, so each chain should have 1522 or 1521 records. But at first glance, I cannot figure out which block goes wrong, because the output you pasted indicates all blocks are different. So I suggest

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-24 Thread Michael Ulbrich
Hi Joseph, thanks a lot for your help. It is very much appreciated! I ran debugsfs.ocfs2 from ocfs2-tools 1.6.4 on the mounted file system: root@s1a:~# debugfs.ocfs2 -R 'stat //global_bitmap' /dev/drbd1 > debugfs_drbd1.log 2>&1 Inode: 13 Mode: 0644 Generation: 1172963971 (0x45ea0283) FS

Re: [Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-23 Thread Joseph Qi
Hi Michael, Could you please use debugfs to check the output? # debugfs.ocfs2 -R 'stat //global_bitmap' Thanks, Joseph On 2016/3/24 6:38, Michael Ulbrich wrote: > Hi ocfs2-users, > > my first post to this list from yesterday probably didn't get through. > > Anyway, I've made some progress in

[Ocfs2-users] fsck.ocfs2 loops + hangs but does not check

2016-03-23 Thread Michael Ulbrich
Hi ocfs2-users, my first post to this list from yesterday probably didn't get through. Anyway, I've made some progress in the meantime and may now ask more specific questions ... I'm having issues with an 11 TB ocfs2 shared filesystem on Debian Wheezy: Linux s1a 3.2.0-4-amd64 #1 SMP Debian

Re: [Ocfs2-users] fsck.ocfs2

2011-05-16 Thread Xavier Diumé
I don't know if is it possible, but kernel panic error is not in /var/log/kern.log. 2011/5/13 Sunil Mushran sunil.mush...@oracle.com Please do not remove the cc-s. Hard for me to comment without knowing anything about the panic. However, assuming that the panic message indicated that the

[Ocfs2-users] fsck.ocfs2

2011-05-13 Thread Xavier Diumé
Hello, Is it possible to fsck a mounted filesystem. When one of the cluster nodes reboots because a kernel panic, the device requires fsck.ocfs2 because in mounted.ocfs2 -f rebooted node is shown. -- Xavier Diumé http://socaqui.cat ___ Ocfs2-users

Re: [Ocfs2-users] fsck.ocfs2

2011-05-13 Thread Sunil Mushran
On 05/13/2011 11:44 AM, Xavier Diumé wrote: Hello, Is it possible to fsck a mounted filesystem. When one of the cluster nodes reboots because a kernel panic, the device requires fsck.ocfs2 because in mounted.ocfs2 -f rebooted node is shown. If mounted.ocfs2 -f shows the rebooted node, that

Re: [Ocfs2-users] fsck.ocfs2

2011-05-13 Thread Xavier Diumé
But initially the system had devices in /etf/fstab with _netdev option. When system starts mounting a kernel panic appears, sometimes after few minuts. The only way that I could start the system was mounting all devices one by one, with a previups fsck. I don't know if it is the better way, but is

[Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Ulf Zimmermann
We are setting up 2 new EL5 U4 machines to replace our current database servers running our demo environment. We use 3Par SANs and their snap clone options. The current production system we snap clone from is EL4 U5 with ocfs2 1.2.9, the new servers have ocfs2 1.4.3 installed. Part of the

Re: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Ulf Zimmermann
: [Ocfs2-users] fsck.ocfs2 using huge amount of memory? We are setting up 2 new EL5 U4 machines to replace our current database servers running our demo environment. We use 3Par SANs and their snap clone options. The current production system we snap clone from is EL4 U5 with ocfs2 1.2.9, the new

Re: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Ulf Zimmermann
is 1.4.3. -Original Message- From: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users- boun...@oss.oracle.com] On Behalf Of Ulf Zimmermann Sent: Thursday, May 20, 2010 6:00 PM To: ocfs2-users@oss.oracle.com Subject: [Ocfs2-users] fsck.ocfs2 using huge amount of memory? We

Re: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Joel Becker
On Thu, May 20, 2010 at 06:00:19PM -0700, Ulf Zimmermann wrote: We are setting up 2 new EL5 U4 machines to replace our current database servers running our demo environment. We use 3Par SANs and their snap clone options. The current production system we snap clone from is EL4 U5 with ocfs2

Re: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Sunil Mushran
...@oss.oracle.com [mailto:ocfs2-users- boun...@oss.oracle.com] On Behalf Of Ulf Zimmermann Sent: Thursday, May 20, 2010 6:00 PM To: ocfs2-users@oss.oracle.com Subject: [Ocfs2-users] fsck.ocfs2 using huge amount of memory? We are setting up 2 new EL5 U4 machines to replace our current

[Ocfs2-users] fsck.ocfs2 question

2010-04-12 Thread Schildwachter, Xavier
We setup four nodes connected to an iSCSI SAN with 3 ocfs2 volumes. Last Friday, one of the volumes was set read-only because of the following error: Apr 9 01:44:43 fremont kernel: [44313.726447] OCFS2: ERROR (device dm-3): ocfs2_validate_gd_self: Group descriptor #56060928 has bit count

Re: [Ocfs2-users] fsck.ocfs2 can't fix an orphaned inode

2010-03-19 Thread Carl J. Benson
Sunil, Bug 1236. Thanks very much. -- Carl Benson, PHS Linux SysAdmin (206-667-4862, cben...@fhcrc.org) On 03/18/2010 11:32 AM, Sunil Mushran wrote: One option is to provide me with the o2image of the volume. # o2image -r /dev/sda1 - | bzip2 sda1.out.bz2 File a bugzilla and add the link

[Ocfs2-users] fsck.ocfs2 can't fix an orphaned inode

2010-03-18 Thread Carl J. Benson
Hello! I searched through the mailing list back to 07/2008, and didn't see this question answered before. I have 7 systems that use an ocfs2 filesystem. After many months of solid reliable use, they all crashed yesterday. 6 systems run openSUSE 11.1, kernel 2.627.29-0.1-default, with these

Re: [Ocfs2-users] fsck.ocfs2 can't fix an orphaned inode

2010-03-18 Thread Sunil Mushran
One option is to provide me with the o2image of the volume. # o2image -r /dev/sda1 - | bzip2 sda1.out.bz2 File a bugzilla and add the link to that image. (The bz cannot handle large files.) The other option is to file a bz and attach the stat_sysdir output.