Re: [Ocfs2-users] Huge Problem ocfs2

2012-11-09 Thread Laurentiu Gosu
Hi Sunil, Do you ANY other idea to recover our data? Maybe you know same recovery tool that we could use? We would really need it. Thank you for your help. Laurentiu. On 11/10/2012 04:25, Marian Serban wrote: debugfs: ls / ls: Bad magic number in inode while checking directory at block 129

Re: [Ocfs2-users] ocfs2_unlink:953 ERROR: status = -39

2012-01-11 Thread Laurentiu Gosu
I keep getting the messages. Anybody, any idea? lg. On 12/21/2011 11:21, Laurentiu Gosu wrote: No idea? I found this http://oss.oracle.com/pipermail/ocfs2-devel/2011-May/008077.html but i'm not sure it's related. br, lg. On 12/20/2011 12:31, Laurentiu Gosu wrote: Hi, I have just moved

[Ocfs2-users] ocfs2_unlink:953 ERROR: status = -39

2011-12-20 Thread Laurentiu Gosu
Hi, I have just moved(about 2 weeks ago) an OCFS2 cluster into production(latest ocfs2, kernel 2.6.39). I'm getting in nodes dmesg this kind of messages: (rmdir,19142,7):ocfs2_unlink:953 ERROR: status = -39 (rmdir,19305,10):ocfs2_unlink:953 ERROR: status = -39 (rmdir,19556,7):ocfs2_unlink:953

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-12-11 Thread Laurentiu Gosu
ocfs2_hb_ctl in gdb. The problem is in the code path that begins in the function lookup_dev(). On 10/23/2011 01:30 PM, Laurentiu Gosu wrote: #rpm -qa |grep ocfs2 ocfs2console-1.6.3-2.el5 ocfs2-tools-1.6.3-2.el5 Just let me know if I can give more details to find the problem. I will move ocfs2

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-23 Thread Laurentiu Gosu
to start both nodes from scratch. Do not start/stop heartbeat manually. Also, do not force-format. On 10/18/2011 03:54 PM, Laurentiu Gosu wrote: OK, i rebooted one of the nodes(both had similar issues); . But something is still fishy. - i mounted the device: mount -t ocfs2 /dev/volgr1/lvol0 /mnt/tmp

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-23 Thread Laurentiu Gosu
/2011 17:57, Sunil Mushran wrote: I think it stops by uuid. So try doing this the next time. You are encountering some issue that we have not seen before. ocfs2_hb_ctl -K -u 0C4AB55FE9314FA5A9F81652FDB9B22D ocfs2 On 10/23/2011 05:32 AM, Laurentiu Gosu wrote: Hi Sunil, Sorry for my late reply, i just

[Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
Hi, I have a 2 nodes ocfs2 cluster running UEK 2.6.32-100.0.19.el5, ocfs2console-1.6.3-2.el5, ocfs2-tools-1.6.3-2.el5. My problem is that all the time when i try to run /etc/init.d/o2cb stop it fails with this error: Stopping O2CB cluster CLUSTER: Failed Unable to stop cluster as

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
/sys/kernel/debug/o2dlm On 10/18/2011 02:14 PM, Laurentiu Gosu wrote: Here is the output: ls -lR /sys/kernel/config/cluster /sys/kernel/config/cluster: total 0 drwxr-xr-x 4 root root 0 Oct 19 00:12 CLUSTER /sys/kernel/config/cluster/CLUSTER: total 0 -rw-r--r-- 1 root root 4096 Oct 19 00

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
wrote: mount -t debugfs debugfs /sys/kernel/debug Then list that dir. Also, do: ocfs2_hb_ctl -l -d /dev/dm-2 Be careful before killing. We want to be sure that dev is not mounted. On 10/18/2011 02:23 PM, Laurentiu Gosu wrote: Again the outputs: cat /sys/kernel/config/cluster/CLUSTER

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
to fail. Do: mounted.ocfs2 -d On 10/18/2011 02:32 PM, Laurentiu Gosu wrote: ls -lR /sys/kernel/debug/ocfs2 /sys/kernel/debug/ocfs2: total 0 ls -lR /sys/kernel/debug/o2dlm /sys/kernel/debug/o2dlm: total 0 ocfs2_hb_ctl -I -d /dev/dm-2 ocfs2_hb_ctl: Device name specified was not found

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
ocfs2_hb_ctl -I -u 0C4AB55FE9314FA5A9F81652FDB9B22D 0C4AB55FE9314FA5A9F81652FDB9B22D: 0 refs On 10/19/2011 00:43, Sunil Mushran wrote: ocfs2_hb_ctl -l -u 0C4AB55FE9314FA5A9F81652FDB9B22D On 10/18/2011 02:40 PM, Laurentiu Gosu wrote: mounted.ocfs2 -d DeviceFS Stack UUID

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
, Laurentiu Gosu wrote: ocfs2_hb_ctl -I -u 0C4AB55FE9314FA5A9F81652FDB9B22D 0C4AB55FE9314FA5A9F81652FDB9B22D: 0 refs On 10/19/2011 00:43, Sunil Mushran wrote: ocfs2_hb_ctl -l -u 0C4AB55FE9314FA5A9F81652FDB9B22D On 10/18/2011 02:40 PM, Laurentiu Gosu wrote: mounted.ocfs2 -d DeviceFS

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
:04, Sunil Mushran wrote: Let's do it by hand. rm -rf /sys/kernel/config/cluster/.../heartbeat/*0C4AB55FE9314FA5A9F81652FDB9B22D * On 10/18/2011 02:52 PM, Laurentiu Gosu wrote: ocfs2_hb_ctl -K -u 0C4AB55FE9314FA5A9F81652FDB9B22D ocfs2_hb_ctl: File not found by ocfs2_lookup while stopping

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
, Laurentiu Gosu wrote: well..this is weird ls /sys/kernel/config/cluster/CLUSTER/heartbeat/ *918673F06F8F4ED188DDCE14F39945F6* dead_threshold looks like we have different UUIDs. Where is this coming from?? ocfs2_hb_ctl -I -u 918673F06F8F4ED188DDCE14F39945F6 918673F06F8F4ED188DDCE14F39945F6: 1

Re: [Ocfs2-users] Unable to stop cluster as heartbeat region still active

2011-10-18 Thread Laurentiu Gosu
formats on that volume. The format will generate a new uuid. Once that happens, the hb tool cannot map the region to the device and thus fail to stop it. Right now the easiest option on this box is resetting it. On 10/18/2011 03:24 PM, Laurentiu Gosu wrote: Yes, i did reformat it(even more than once i