I would suggest to upgrade kernel to more recent version or to compile default clean kernel by yourself. 2.6.18.53 is pretty old already, as we have 2.6.24.xx. I use 2.6.23.13 and have no problems. If your computer just hangs - it's kernel or hardware problem, in any other case you would see kernel Oops page. By the way, you have different versions of ocfs2 tools and kernel module.
----- Original Message ----- From: "Sunil Mushran" <[EMAIL PROTECTED]> To: "Miguel Santos N." <[EMAIL PROTECTED]> Cc: <[email protected]>; "Joel Becker" <[EMAIL PROTECTED]> Sent: Wednesday, April 02, 2008 4:05 AM Subject: Re: [Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online Do a mount and you should see configfs mounted: configfs on /sys/kernel/config type configfs (rw) If so, the hang is puzzling. Email opensuse help list if anyone else has encountered the same. Joel, Any ideas? Miguel Santos N. wrote: > I tried to do what you told me but it hangs and it doesn't even log to > the output file the result of strace. > > However I ran it without -o switch and here is it: > > ========== > execve("/sbin/o2cb_ctl", ["o2cb_ctl", "-H", "-n", "ocfs2", "-t", > "cluster", "-a", "online=yes"], [/* 61 vars */]) = 0 > uname({sys="Linux", node="drwagner", ...}) = 0 > brk(0) = 0x6cf000 > brk(0x6cff70) = 0x6cff70 > arch_prctl(ARCH_SET_FS, 0x6cf8b0) = 0 > brk(0x6f0f70) = 0x6f0f70 > brk(0x6f1000) = 0x6f1000 > open("/sys/o2cb/interface_revision", O_RDONLY) = 3 > read(3, "5\n", 15) = 2 > read(3, "", 13) = 0 > close(3) = 0 > stat("/sys/kernel/config", {st_mode=S_IFDIR|0755, st_size=0, ...}) = 0 > statfs("/sys/kernel/config", {f_type=0x62656570, f_bsize=4096, > f_blocks=0, f_bfree=0, f_bavail=0, f_files=0, f_ffree=0, f_fsid={0, > 0}, f_namelen=255, f_frsize=4096}) = 0 > stat("/etc/ocfs2/cluster.conf", {st_mode=S_IFREG|0644, st_size=331, > ...}) = 0 > open("/etc/ocfs2/cluster.conf", O_RDONLY) = 3 > read(3, "node:\n\tip_port = 7777\n\tip_addres"..., 4000) = 331 > read(3, "", 4000) = 0 > close(3) = 0 > mkdir("/sys/kernel/config/cluster/ocfs2", 0755 > ========= > > As you told me, the last command is not executed at all and the > computer hangs. I tried to execute: > > mkdir -p -m 755 /sys/kernel/config/cluster/ocfs2 > > And guess what? it hangs too =P > > So I guess OCFS2 is not the problem but something with the kernel space. > > What do you think? > > Sunil Mushran escribi?: >> So that looks ok. >> >> Next step would be to do some strac-ing. >> >> Do: >> /etc/init.d/o2cb load >> >> then do: >> strace -ff -o /tmp/out o2cb_ctl -H -n ocfs2 -t cluster -a online=yes >> >> The last command is what "onlines" the cluster. >> >> /tmp/out should have some information. >> >> Miguel Santos N. wrote: >>> Thanks.. >>> >>> I am attaching the file /etc/ocfs/cluster.conf, it is the same file >>> on all the servers. >>> >>> >>> >>> Sunil Mushran escribi?: >>>> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting >>>> cluster >>>> includes reading cluster.conf and populating /sys/kernel/config with >>>> the info. Make that file available as-is... as in, not a cut-paste. >>>> >>>> Miguel Santos wrote: >>>>> Hello, >>>>> >>>>> I have a DELL MD3000i, and a couple of servers that I want to >>>>> connect to >>>>> the array. >>>>> >>>>> I have setup the main server with CentOS 5. >>>>> - 2.6.18-53.el5 x86_64 >>>>> - ocfs2-tools-1.2.7-1.el5 >>>>> - ocfs2console-1.2.7-1.el5 >>>>> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module >>>>> >>>>> And two OpenSuSe 10.3 Servers >>>>> - 2.6.22.5-31-default x86_64 >>>>> - ocfs2-tools-1.2.6-18 >>>>> - ocfs2console-1.2.6-18 >>>>> The module has been already shipped with the kernel (thanks to >>>>> novell) >>>>> >>>>> I had no problems mounting the FS on the CentOS 5 server. >>>>> I can read/write in the FS without problems. >>>>> >>>>> ============ >>>>> [EMAIL PROTECTED] ~]# mounted.ocfs2 -d >>>>> Device FS UUID >>>>> Label >>>>> /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3 >>>>> Coliseo >>>>> [EMAIL PROTECTED] ~]# mounted.ocfs2 -f >>>>> Device FS Nodes >>>>> /dev/sdb1 ocfs2 arenamexico >>>>> [EMAIL PROTECTED] ~]# mount|grep ocfs2 >>>>> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw) >>>>> /dev/sdb1 on /mnt/projects type ocfs2 >>>>> (rw,_netdev,noatime,heartbeat=local) >>>>> ============ >>>>> >>>>> But when I try to start the o2cb service on the Opensuse servers, the >>>>> servers just hang. >>>>> >>>>> ============ >>>>> drwagner:~ # /etc/init.d/o2cb status >>>>> Module "configfs": Loaded >>>>> Filesystem "configfs": Mounted >>>>> Module "ocfs2_nodemanager": Loaded >>>>> Module "ocfs2_dlm": Loaded >>>>> Module "ocfs2_dlmfs": Loaded >>>>> Filesystem "ocfs2_dlmfs": Mounted >>>>> >>>>> drwagner:~ # /etc/init.d/o2cb online ocfs2 >>>>> Loading module "configfs": OK >>>>> Mounting configfs filesystem at /sys/kernel/config: OK >>>>> Loading module "ocfs2_nodemanager": OK >>>>> Loading module "ocfs2_dlm": OK >>>>> Loading module "ocfs2_dlmfs": OK >>>>> Mounting ocfs2_dlmfs filesystem at /dlm: OK >>>>> Starting O2CB cluster ocfs2: >>>>> >>>>> [And it hangs] >>>>> ============= >>>>> >>>>> Does anyone has a clue of what is happening? >>>>> >>>>> Thanks. >>>>> >>>>> >>>>> _______________________________________________ >>>>> Ocfs2-users mailing list >>>>> [email protected] >>>>> http://oss.oracle.com/mailman/listinfo/ocfs2-users >>>>> >>>> >>> >>> ------------------------------------------------------------------------ >>> >>> >>> node: >>> ip_port = 7777 >>> ip_address = 192.168.130.1 >>> number = 0 >>> name = arenamexico >>> cluster = ocfs2 >>> >>> node: >>> ip_port = 7777 >>> ip_address = 192.168.130.2 >>> number = 1 >>> name = suse1 >>> cluster = ocfs2 >>> >>> node: >>> ip_port = 7777 >>> ip_address = 192.168.130.3 >>> number = 2 >>> name = drwagner >>> cluster = ocfs2 >>> >>> cluster: >>> node_count = 3 >>> name = ocfs2 >>> >>> >> > _______________________________________________ Ocfs2-users mailing list [email protected] http://oss.oracle.com/mailman/listinfo/ocfs2-users _______________________________________________ Ocfs2-users mailing list [email protected] http://oss.oracle.com/mailman/listinfo/ocfs2-users
