On 3/31/2016 10:37 PM, Junxiao Bi wrote:
> On 04/01/2016 11:20 AM, Jay Vasa wrote:
>> On 3/31/2016 6:36 PM, Herbert van den Bergh wrote:
>>> It seems to me that the reason fsck -fn is reporting errors is because
>>> it isn't replaying the journal:
>>>
>>> ** Skipping journal replay because -n was
On 3/31/2016 6:28 PM, Junxiao Bi wrote:
> On 04/01/2016 09:21 AM, Jay Vasa wrote:
>> I never did an fsck -fn with it being mounted. I understand that will
>> cause cause errors.
>> It has never been mounted whenever I did any fsck, either -fn -fy. I was
>> trying to say that it sucks that I have
Hi,
>> So, we have 2 problems now.
>> What's the matter with fsck?
>> It's very weired:-/
> Yes very weird. The main issue is that I need to fsck this filesystem. I
> hope Junxiao can help.
>> How this error happend in kernel?
>> If there's not solution available right now, none of them is easy
Hi Michael,
Yes, currently the best way is to copy out data as much as possible and
recreate the ocfs2 volume, then restore back the data.
I haven't encountered this issue before and don't know which case can
lead to it, so I'm sorry I can't give you the advice which can avoid
this issue.
But I
Joseph,
thanks again for your help!
Currently I'm dumping out 4 TB of data from the broken ocfs2 device to
an external disk. I have shut down the cluster and have the fs mounted
read-only on a single node. It seems that the data structures are still
intact and that the file system problems are
Hi Michael,
On 2016/3/24 21:47, Michael Ulbrich wrote:
> Hi Joseph,
>
> thanks for this information although this does not sound too optimistic ...
>
> So, if I understand you correctly, if we had a metadata backup from
> o2image _before_ the crash we could have looked up the missing info to
>
Hi Joseph,
thanks for this information although this does not sound too optimistic ...
So, if I understand you correctly, if we had a metadata backup from
o2image _before_ the crash we could have looked up the missing info to
remove the loop from group chain 73, right?
But how could the loop
Hi Michael,
So I think the block of record #153 goes wrong, which points next to
block 4083643392 of record #19.
But the problem is we don't know the right info of the block of record
#153, otherwise we can dd out, edit it and then dd in to fix it.
Thanks,
Joseph
On 2016/3/24 18:38, Michael
Hi Joseph,
ok, got it! Here's the loop in chain 73:
Group Chain: 73 Parent Inode: 13 Generation: 1172963971
CRC32: ECC:
## Block#TotalUsed Free Contig Size
0428077363215872114874385 1774 1984
1258326323215872
Hi Michael,
It seems that dead loop happens in chain 73. You have formatted using 2K
block and 4K cluster, so each chain should have 1522 or 1521 records.
But at first glance, I cannot figure out which block goes wrong, because
the output you pasted indicates all blocks are different. So I suggest
Hi Joseph,
thanks a lot for your help. It is very much appreciated!
I ran debugsfs.ocfs2 from ocfs2-tools 1.6.4 on the mounted file system:
root@s1a:~# debugfs.ocfs2 -R 'stat //global_bitmap' /dev/drbd1 >
debugfs_drbd1.log 2>&1
Inode: 13 Mode: 0644 Generation: 1172963971 (0x45ea0283)
FS
Hi Michael,
Could you please use debugfs to check the output?
# debugfs.ocfs2 -R 'stat //global_bitmap'
Thanks,
Joseph
On 2016/3/24 6:38, Michael Ulbrich wrote:
> Hi ocfs2-users,
>
> my first post to this list from yesterday probably didn't get through.
>
> Anyway, I've made some progress in
Hi ocfs2-users,
my first post to this list from yesterday probably didn't get through.
Anyway, I've made some progress in the meantime and may now ask more
specific questions ...
I'm having issues with an 11 TB ocfs2 shared filesystem on Debian Wheezy:
Linux s1a 3.2.0-4-amd64 #1 SMP Debian
I don't know if is it possible, but kernel panic error is not in
/var/log/kern.log.
2011/5/13 Sunil Mushran sunil.mush...@oracle.com
Please do not remove the cc-s.
Hard for me to comment without knowing anything about the panic.
However, assuming that the panic message indicated that the
Hello,
Is it possible to fsck a mounted filesystem. When one of the cluster nodes
reboots because a kernel panic, the device requires fsck.ocfs2 because in
mounted.ocfs2 -f rebooted node is shown.
--
Xavier Diumé
http://socaqui.cat
___
Ocfs2-users
On 05/13/2011 11:44 AM, Xavier Diumé wrote:
Hello,
Is it possible to fsck a mounted filesystem. When one of the cluster nodes
reboots because a kernel panic, the device requires fsck.ocfs2 because in
mounted.ocfs2 -f rebooted node is shown.
If mounted.ocfs2 -f shows the rebooted node, that
But initially the system had devices in /etf/fstab with _netdev option. When
system starts mounting a kernel panic appears, sometimes after few minuts.
The only way that I could start the system was mounting all devices one by
one, with a previups fsck.
I don't know if it is the better way, but is
We are setting up 2 new EL5 U4 machines to replace our current database servers
running our demo environment. We use 3Par SANs and their snap clone options.
The current production system we snap clone from is EL4 U5 with ocfs2 1.2.9,
the new servers have ocfs2 1.4.3 installed. Part of the
: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?
We are setting up 2 new EL5 U4 machines to replace our current database
servers running our demo environment. We use 3Par SANs and their snap
clone options. The current production system we snap clone from is EL4
U5 with ocfs2 1.2.9, the new
is 1.4.3.
-Original Message-
From: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users-
boun...@oss.oracle.com] On Behalf Of Ulf Zimmermann
Sent: Thursday, May 20, 2010 6:00 PM
To: ocfs2-users@oss.oracle.com
Subject: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?
We
On Thu, May 20, 2010 at 06:00:19PM -0700, Ulf Zimmermann wrote:
We are setting up 2 new EL5 U4 machines to replace our current database
servers running our demo environment. We use 3Par SANs and their snap clone
options. The current production system we snap clone from is EL4 U5 with
ocfs2
...@oss.oracle.com [mailto:ocfs2-users-
boun...@oss.oracle.com] On Behalf Of Ulf Zimmermann
Sent: Thursday, May 20, 2010 6:00 PM
To: ocfs2-users@oss.oracle.com
Subject: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?
We are setting up 2 new EL5 U4 machines to replace our current
We setup four nodes connected to an iSCSI SAN with 3 ocfs2 volumes.
Last Friday, one of the volumes was set read-only because of the following
error:
Apr 9 01:44:43 fremont kernel: [44313.726447] OCFS2: ERROR (device dm-3):
ocfs2_validate_gd_self: Group descriptor #56060928 has bit count
Sunil,
Bug 1236. Thanks very much.
--
Carl Benson, PHS Linux SysAdmin (206-667-4862, cben...@fhcrc.org)
On 03/18/2010 11:32 AM, Sunil Mushran wrote:
One option is to provide me with the o2image of the volume.
# o2image -r /dev/sda1 - | bzip2 sda1.out.bz2
File a bugzilla and add the link
Hello!
I searched through the mailing list back to 07/2008, and didn't see
this question answered before.
I have 7 systems that use an ocfs2 filesystem. After many months of
solid reliable use, they all crashed yesterday.
6 systems run openSUSE 11.1, kernel 2.627.29-0.1-default, with these
One option is to provide me with the o2image of the volume.
# o2image -r /dev/sda1 - | bzip2 sda1.out.bz2
File a bugzilla and add the link to that image. (The bz cannot
handle large files.)
The other option is to file a bz and attach the stat_sysdir output.
26 matches
Mail list logo