Re: [Ocfs2-users] Odd error on FC12 with ocfs2

2010-03-29 Thread Sunil Mushran
No On Mar 29, 2010, at 8:10 PM, Angelo McComis ang...@mccomis.com wrote: Does it matter that the nodes are numbered 1-6 instead of 0-5? On Mon, Mar 29, 2010 at 4:25 PM, Sunil Mushran sunil.mush...@oracle.com wrote: Enable some debugging. #debugfs.ocfs2 -l TCP allow ...do mount

Re: [Ocfs2-users] ENOSPC

2010-03-26 Thread Sunil Mushran
Please file a bugzilla. This may involve more people. And it is easier to track that way. David Johle wrote: See attachment for requested output. At 08:54 PM 3/24/2010, Sunil Mushran wrote: Quite a bit of work is ongoing on this front. I'll list all that work in another email. Meanwhile

Re: [Ocfs2-users] download for 2.6.18-164.15.1.el5

2010-03-26 Thread Sunil Mushran
http://oss.oracle.com/projects/ocfs2/files/RedHat/RHEL5/ Devender Narula wrote: Hi Guys i need ocfs2 software for 2.6.18-164.15.1.el5 RHEL 64 bit.. Can anybody please tell me from where i can download it. thanks Devender ___

Re: [Ocfs2-users] Odd error on FC12 with ocfs2

2010-03-25 Thread Sunil Mushran
hmm.. o2cb_ctl makes no connections. It just reads the cluster.conf and populates configfs. AFAIK. David Murphy wrote: We had 6 nodes running CentOS 5.4 using 1.4.3 ocfs2-tools. I decided to rebuild one node with FC12. Which is working fine, however Nmap 192.168.200.112

Re: [Ocfs2-users] ENOSPC

2010-03-24 Thread Sunil Mushran
Quite a bit of work is ongoing on this front. I'll list all that work in another email. Meanwhile make a bz with the stat_sysdir output. We'll need that to determine the best way forward. David Johle wrote: So in light of prior issues with lock contention and such due to writing apache logs

Re: [Ocfs2-users] Can't delete LV snapshot after mounting

2010-03-19 Thread Sunil Mushran
? Thank's Armin -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Thursday, March 18, 2010 6:28 PM To: Armin Wied Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Can't delete LV snapshot after mounting I am queasy recommending

Re: [Ocfs2-users] Can't delete LV snapshot after mounting

2010-03-18 Thread Sunil Mushran
I am queasy recommending such a setup to anyone. It is one thing to handle a workload. The problem is about handling user/admin errors. You are essentially running a local volume manager that is unaware of the other node. Any reconfig that is not coordinated will lead to corruption. Below that

Re: [Ocfs2-users] OCFS2 Multipath Configuration

2010-03-18 Thread Sunil Mushran
Yeah.. mounted is a bit dumb. In the next release, it will recognize /dev/mapper devices. We still need to teach it to handle multipathing fully. David Johle wrote: I'm not sure about why mounted.ocfs2 is showing both the dm and the sd devices for the same volume. But this could all be very

Re: [Ocfs2-users] NFS in D State

2010-03-18 Thread Sunil Mushran
I am assuming you are mounting the nfs mounts with the nordirplus mount option. If not, that is known to deadlock a nfsd thread leading to what you are seeing. There are two possible reasons for this error. One is a dlm issue. Other is a local deadlock like above. To see if the dlm is the cause

Re: [Ocfs2-users] fsck.ocfs2 can't fix an orphaned inode

2010-03-18 Thread Sunil Mushran
One option is to provide me with the o2image of the volume. # o2image -r /dev/sda1 - | bzip2 sda1.out.bz2 File a bugzilla and add the link to that image. (The bz cannot handle large files.) The other option is to file a bz and attach the stat_sysdir output.

Re: [Ocfs2-users] NFS in D State

2010-03-18 Thread Sunil Mushran
- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Thursday, March 18, 2010 1:25 PM To: Jaquays, Michael A. Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] NFS in D State I am assuming you are mounting the nfs mounts with the nordirplus mount option. If not, that is known

Re: [Ocfs2-users] Disk access hang

2010-03-11 Thread Sunil Mushran
. Instead, another node contains the following: Mar 4 04:10:29 node05 kernel: (1861,1):ocfs2_replay_journal:1224 Recovering node 9 from slot 7 on device (152,0) But the ocfs2 disk was unavailable anyway. Any other hint? Regards, G. On Wed, Mar 10, 2010 at 8:56 PM, Sunil Mushran sunil.mush

Re: [Ocfs2-users] Disk access hang

2010-03-10 Thread Sunil Mushran
Were the first set of messages on all nodes? On that node atleast the o2hb node down event fired. It should have fired on all nodes. This is the dlm eviction message. If they all fired, then look for a node to have a message that reads Node x is the Recovery Master for the Dead Node y. That

Re: [Ocfs2-users] POSIX locks supported?

2010-03-10 Thread Sunil Mushran
To get this feature, one needs both a kernel = 2.6.28 and a functioning userspace cluster stack. ocfs2 1.4 on (rh)el5/sles10 does not satisfy either of the two. As of now, there are two such stacks. Pacemaker from Novell and the new CMAN (new is my term... unsure how RH will be marketing it)

Re: [Ocfs2-users] Filesystem ready check...

2010-03-04 Thread Sunil Mushran
It is waiting for the heartbeat timeout to trigger denoting a node death. Then it initiates recovery. With the default settings, this takes about a minute plus. This is with the o2cb cluster stack. Note not all fs ops will hang during the detection phase. Only those ops will hang that directly or

Re: [Ocfs2-users] support xattr, quota on RHEL5 of OCFS2 1.4

2010-03-03 Thread Sunil Mushran
ocfs2 release 1.4 does not have these features. These features (expect quotas) will be in ocfs2 release 1.6. The 1.4 user's guide lists the features included in that release. Elia Pinto wrote: Hi I'd like to know if OCFS2 1.4 on RHEL5 with the REDHAT latest kernel supports disk quotas, the

Re: [Ocfs2-users] I have questions regarding Fencing

2010-03-02 Thread Sunil Mushran
On Mar 2, 2010, at 2:52 AM, Brad Plant bpl...@iinet.net.au wrote: On Tue, 2 Mar 2010 21:51:45 +1100 Brad Plant bpl...@iinet.net.au wrote: On Tue, 2 Mar 2010 05:09:21 -0500 Enrique Sanchez esanchezvela.redhatclus...@gmail.com wrote: During my test (take Node0 down cold turkey) Node1 hung

Re: [Ocfs2-users] prefetch?

2010-03-01 Thread Sunil Mushran
Dmitry Rybin wrote: I have storage 2TB with 4K cluster size. Collected statistic says, that 4K max reading block from disc. But extX - variable read size up to 50K with cluster size 4K. ocfs2 have no prefetch? Centos 5.4, ocfs2-1.4.4-1 Is the workload the same on the two volumes? This

Re: [Ocfs2-users] ocfs2 filesystem R/O in one node but not in the 2nd.

2010-02-19 Thread Sunil Mushran
The volume goes ro when it detects an on-disk corruption. I would imagine the node detected the problem with the group allocator fixed by fsck. dmesg will tell you more. If it mentions block#1709568, then fsck took care of it. Sunil Enrique Sanchez wrote: Hello folks, I have a Filesystem that

Re: [Ocfs2-users] Kernel Panic ocfs2_inode_lock_full

2010-02-18 Thread Sunil Mushran
Yes, this is a known issue. Only occurs when nfs is in the equation. This issue has been fixed in mainline quite some time ago. We are in the process of backporting that to 1.4. michael.a.jaqu...@verizon.com wrote: All, I have a 3 node cluster that is experiencing kernel panics once every few

Re: [Ocfs2-users] OCFS2 + ISCSI SAN slowdown

2010-02-18 Thread Sunil Mushran
That's a poor workload for a clustered file system because it has to master/take clustered locks for each inode. Actually multiple locks per inode. Those locks are used once and then freed. A local fs only takes a read hit. loebber...@eplan.de wrote: Hi Guy, if you have serveral OCFS2

Re: [Ocfs2-users] Kernel Panic ocfs2_inode_lock_full

2010-02-18 Thread Sunil Mushran
- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Thursday, February 18, 2010 11:47 AM To: Jaquays, Michael A. Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Kernel Panic ocfs2_inode_lock_full Yes, this is a known issue. Only occurs when nfs is in the equation

Re: [Ocfs2-users] 2.6.33-rc8 bug fixes - 2.6.32

2010-02-12 Thread Sunil Mushran
Brad Plant wrote: I noticed that there were a lot of bug fixes in 2.6.33-rc8. Just wondering if any of these are also applicable to 2.6.32 and if they'll be merged into the long term stable branch? If there are no 2.6.32 merge plans, are any of the 2.6.33 commits beneficial to 2.6.32 and

Re: [Ocfs2-users] system crash

2010-02-11 Thread Sunil Mushran
It looks similar to bz#1202. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1202 File a bugzilla with novell. They should be able to look at the objdump and confirm. If so, the fix is already in mainline.

Re: [Ocfs2-users] system crash

2010-02-11 Thread Sunil Mushran
No. Very hard to hit it. Brad Plant wrote: Hi Sunil, Did this also go into the 2.6.32.y stable branch? Cheers, Brad On Thu, 11 Feb 2010 10:50:24 -0800 Sunil Mushran sunil.mush...@oracle.com wrote: It looks similar to bz#1202. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1202

Re: [Ocfs2-users] another fencing question

2010-02-05 Thread Sunil Mushran
is the Recovery Master for the Dead Node 1 for Domain 3AE0B7F3BAB749D09D37DAE16FA38042 Jan 27 15:31:10 nvr2-rc kernel: (6983,1):ocfs2_replay_journal:1183 Recovering node 1 from slot 0 on device (120,1) thanks Nicola In data giovedì 14 gennaio 2010 21:13:15, Sunil Mushran ha scritto

Re: [Ocfs2-users] Strange mounting problem

2010-02-04 Thread Sunil Mushran
Check /etc/fstab. The device associated with /u01 is not the correct one. Norman P. B. Joseph wrote: What would cause OCFS2 filesystems to be mountable via 'ocfs2console', but not via the command line (or /etc/init.d/ocfs2)? The symptoms are: # mount /u01 mount.ocfs2: Bad magic number in

Re: [Ocfs2-users] invalid opcode bug in dlmglue?

2010-02-04 Thread Sunil Mushran
in a couple of months and am wondering if we can use the opportunity to turn on that feature for quicker lookup times. Thanks, Brian Sunil Mushran sunil.mush...@oracle.com 2010-02-04 09:16: Fixed. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1137 You probably already have this patch

Re: [Ocfs2-users] Slow on open()

2010-02-02 Thread Sunil Mushran
While it is recommended that the kernels be the same across the cluster, there is no hard requirement that it should be. The recommendation is purely from ease-of-management point-of-view. If both nodes are seeing periodic i/o slowdowns at the same time, then you should investigate from the

Re: [Ocfs2-users] I/O error on channel?

2010-01-21 Thread Sunil Mushran
It was unable to open the device. The path is provided. It is a device issue. unit0x03 wrote: Hi all, I'm having some problems with a setup of OCFS2 on CLVM on DRBD. CLVM crapped out the other day, and now that I've fixed it, all ocfs2 operations report: I/O error on channel while

Re: [Ocfs2-users] Weird crash

2010-01-19 Thread Sunil Mushran
Bottom line: We are contacting Novell by phone support to check why OCFS2 version isn't the latest in SLES10SP3. Regards, Sérgio Em Tue, 01 Sep 2009 11:14:37 -0700 Sunil Mushran sunil.mush...@oracle.com escreveu: For issues on sles, please file a bug/sr with novell. The issue here

Re: [Ocfs2-users] Getting Closer (was: Fencing options)

2010-01-18 Thread Sunil Mushran
PM, Sunil Mushran sunil.mush...@oracle.com wrote: Do you have netconsole output? We have to determine the reason for the fencing before we can recommend any changes. Angelo McComis wrote: Some more about my setup, which started the discussion... Version info, mount options, etc. are herein

Re: [Ocfs2-users] esx elevator=noop

2010-01-15 Thread Sunil Mushran
The deadline recommendation was for early el4 kernels that had a bug in cfq. That bug was fixed years ago. I am unsure how using noop in guest will trigger starvation. Not that I am recommending it. I have not thought about this much. On Jan 15, 2010, at 9:55 AM, Brian Kroth

Re: [Ocfs2-users] another fencing question

2010-01-14 Thread Sunil Mushran
Mailing List SVR wrote: Hi, periodically one of on my two nodes cluster is fenced here are the logs: Jan 14 07:01:44 nvr1-rc kernel: o2net: no longer connected to node nvr2- rc.minint.it (num 0) at 1.1.1.6: Jan 14 07:01:44 nvr1-rc kernel: (21534,1):dlm_do_master_request:1334 ERROR:

Re: [Ocfs2-users] Fencing options

2010-01-13 Thread Sunil Mushran
The problem was likely storage related and not network related. Do you have netconsole setup? If so, look at the logs. It will tell you as to why that node was fenced. Angelo McComis wrote: After learning more about what fencing means when you see it in action. (the default of

Re: [Ocfs2-users] Fencing options

2010-01-13 Thread Sunil Mushran
Do you have netconsole output? We have to determine the reason for the fencing before we can recommend any changes. Angelo McComis wrote: Some more about my setup, which started the discussion... Version info, mount options, etc. are herein. If there are recommended changes to this, I'm open

Re: [Ocfs2-users] hung process -- sles10 sp2

2010-01-13 Thread Sunil Mushran
- 26609 Ss sshd- 26611 Ss bashwait 26698 S+ gdb wait 26894 Ss sshd- 26896 Ss+ bash- 29881 Ss sshd- 29883 Ss bashwait -Original Message- From: Sunil Mushran [mailto:sunil.mush

Re: [Ocfs2-users] mounted.ocfs2 -f shows Unknown

2010-01-12 Thread Sunil Mushran
4b4cd7d8 e869dccb731433ed 03cb88bc 3: 3 4b4cd79e 2713837530adc63c ffdd6076 At 06:46 PM 1/11/2010, Sunil Mushran wrote: Email me the outputs of the following: # debugfs.ocfs2 -R slotmap /dev/dm-6 # debugfs.ocfs2 -R hb /dev/dm-6 wait 10 seconds. # debugfs.ocfs2 -R hb /dev/dm-6

Re: [Ocfs2-users] mounted.ocfs2 -f shows Unknown

2010-01-11 Thread Sunil Mushran
Email me the outputs of the following: # debugfs.ocfs2 -R slotmap /dev/dm-6 # debugfs.ocfs2 -R hb /dev/dm-6 wait 10 seconds. # debugfs.ocfs2 -R hb /dev/dm-6 David Johle wrote: I was setting up a new 3 node cluster of systems and just so happened to have only 2 of the nodes online after a

Re: [Ocfs2-users] OCFS2 1.4.4 Compiled on Centos5 with OpenVZ Kernel won't work - Unknown symbol in module

2010-01-05 Thread Sunil Mushran
What is this unknown symbol? Also, is openvz compatible to rhel? Which one? If not, then getting it to work will not be trivial. You will be better off using the version of the fs in the mainline linux version that openvz is derived from. Jeffery Soo wrote: Does anyone know how to make ocfs2

Re: [Ocfs2-users] OCFS2 1.4.4 Compiled on Centos5 with OpenVZ Kernel won't work - Unknown symbol in module

2010-01-05 Thread Sunil Mushran
Are you building 1.4.4 on that kernel or force loading the provided modules? On Jan 5, 2010, at 3:33 PM, Jeffery Soo j...@realtechtalk.com wrote: Thanks Sunil, here is the log: kernel: ocfs2_dlmfs: disagrees about version of symbol struct_module Sunil Mushran wrote: What is this unknown

Re: [Ocfs2-users] No space left on device

2009-12-17 Thread Sunil Mushran
Brad Plant wrote: Ok. Do you intend on submitting this patch for inclusion in previous mainline kernel versions? Or do you think we will be able to patch older kernel versions (2.6.28+) ourself? We always push patches to the current mainline version. Older mainline kernels are handled on a

Re: [Ocfs2-users] No space left on device

2009-12-16 Thread Sunil Mushran
Brad Plant wrote: I have this happen on different file systems running different mainline kernel versions created with different versions of ocfs2-tools. I've seen 1 file system fill up with only 40% odd in use according to df. What's interesting is that while 1 node can't write anymore,

Re: [Ocfs2-users] No space left on device

2009-12-16 Thread Sunil Mushran
Brad Plant wrote: Thanks for the reply Sunil. I read through the 2 links you provided, but I have a question. The info mentioned modifications to mkfs to pre-allocate some area for extent stuff. Does this mean that we would have to create a new FS to fix the problem or will there also be a

Re: [Ocfs2-users] Combining OCFS2 with Linux software RAID-0?

2009-12-14 Thread Sunil Mushran
That's old. sles11 has added the pacemaker cluster stack that works with clvm. Luis Freitas wrote: Brian, Hmm, I was not aware of this. Seems Novel uses other volume manager, called EVMS, not CLVM (?). From: http://wiki.novell.com/index.php/Linux_Data_Management Some

Re: [Ocfs2-users] Combining OCFS2 with Linux software RAID-0?

2009-12-10 Thread Sunil Mushran
It should work. All ocfs2 needs is a stable shared storage that is consistent across all nodes all the time. Patrick J. LoPresti wrote: Is it possible to run an OCFS2 file system on top of Linux software RAID? Here is my situation. I have four identical disk chassis that perform hardware

Re: [Ocfs2-users] mixed mode mount options?

2009-12-07 Thread Sunil Mushran
mount (we are using Redhat)? If a ro node crashes, will it be fenced off the cluster just like a rw node causing a brief jitter? Any doc that you can refer me to for details? Thank you for your help! Billy --- On Sun, 12/6/09, Sunil Mushran sunil.mush...@oracle.com wrote

Re: [Ocfs2-users] OCFS2 New features list and RHEL5 packages

2009-12-07 Thread Sunil Mushran
Wessel wrote: Looking at the New features list (http://oss.oracle.com/osswiki/OCFS2/NewFeaturesList) I've noticed that a lot of new features are marked as requiring a 2.6.2x or 2.6.3x kernel. As we're keen on using vendor-provided kernels I was wondering what features (namely noatime /

Re: [Ocfs2-users] mixed mode mount options?

2009-12-06 Thread Sunil Mushran
nodes? In other words, will a ro node be a member of the cluster and sending/ receiving heartbeat? I would image a ro node should not affect the cluster in any way should the node fail/crash, but I am not sure if it is correct. Billy --- On Fri, 12/4/09, Sunil Mushran sunil.mush

Re: [Ocfs2-users] mixed mode mount options?

2009-12-04 Thread Sunil Mushran
Billy wrote: Does anyone know if OCFS2 supports mixed mounts? We have 4-node OCFS2 cluster, is it possible for one node to mount read only and the other 3 nodes mount -rw on a same volume? Sure. ___ Ocfs2-users mailing list

Re: [Ocfs2-users] v1.4 and kernel 2.6.24

2009-11-18 Thread Sunil Mushran
No. And you don't need to. The mainline kernel includes ocfs2. The one in 2.6.24 is quite similar to 1.4. On Nov 18, 2009, at 3:05 AM, Dietmar Maurer diet...@proxmox.com wrote: Hi all, is it possible to compile v1.4 with kernel 2.6.24? I get: CC [M]

Re: [Ocfs2-users] v1.4 and kernel 2.6.24

2009-11-18 Thread Sunil Mushran
wrote: We are trying to address this issue in the Proxmox 2.6.24 kernel. http://www.mail-archive.com/ocfs2-users@oss.oracle.com/msg03657.html Is there a patch that might apply to 2.6.24 that would resolve this? Thanks. Bret. On Wednesday 18 November 2009 08:45:40 am Sunil Mushran wrote

Re: [Ocfs2-users] Cannot set heartbeat dead threshold

2009-11-13 Thread Sunil Mushran
Ping Novell Support. The base code works fine. This could be a bug in sles10 sp2. voino...@gmail.com wrote: Hi I have: SLES 10 SP2 (2.6.16.60-0.21-smp) ocfs2-tools-1.4.0-0.3 ocfs2console-1.4.0-0.3 and I can't change heartbeat dead threshold value. Content of /etc/sysconfig/o2cb: #

Re: [Ocfs2-users] how to build rpm packages?

2009-11-05 Thread Sunil Mushran
you need to install the devel packages for all kernel flavors for that version. kernel-devel, kernel-xen-devel and whatever else is valid for that distro/arch. Kristiansen Morten wrote: Hi, I found an url which describes how to build rpm packages from ocfs2 source code:

Re: [Ocfs2-users] Not able to start ORACLE on failover node

2009-11-03 Thread Sunil Mushran
The error states that two control files are not consistent. It is not that a file is inconsistent on two nodes. Ping Oracle support. Wilfredo Ocasio wrote: OCFS2 FS seems to have inconsistency. Oracle will only start on NODE 2. NODE 1 fails with ORA-00214. Any help would be appreciated. +

Re: [Ocfs2-users] A free space trouble in a ocfs2 partition

2009-10-27 Thread Sunil Mushran
ocfs2 and kernel version? My guess is that this is because the orphaned inodes are not getting deleted. This problem has been fixed in the mainline kernel (2.6.31). For enterprise users, the fix was provided with OCFS2 1.4.4 for (RH)EL. SLES10/11 users should upgrade to the latest kernel errata

Re: [Ocfs2-users] A free space trouble in a ocfs2 partition

2009-10-27 Thread Sunil Mushran
aditional steps to fix the trouble? Or with the ocfs2 upgrade is just enough ? Thank you! Regards, Pablo Sunil Mushran escribió: ocfs2 and kernel version? My guess is that this is because the orphaned inodes are not getting deleted. This problem has been fixed in the mainline kernel

Re: [Ocfs2-users] /etc/rc.local and /etc/fstab

2009-10-27 Thread Sunil Mushran
Ensure you have the ocfs2 service enabled so that shutdown will explicitly umount the fs before shutting down the network. McKinley, Reid wrote: Upon system boot, is it ok to mount OCFS2 mounts from /etc/rc.local rather than /etc/fstab ? Are there any downsides to using rc.local that

Re: [Ocfs2-users] Unable to mount cluster on CentOS and Ubunut at the same time

2009-10-21 Thread Sunil Mushran
The production release of ocfs2 (1.2, 1.4, and the upcoming 1.6) is only available for (rh)el and sles. No other distributions. David Murphy wrote: I think I found the core issue… The DLM on Centos is running 1.4.1, but on Ubuntu its 1.3.3, I can’t seem to find any packages for debian or

Re: [Ocfs2-users] Unable to mount cluster on CentOS and Ubunut at the same time

2009-10-21 Thread Sunil Mushran
Message- From: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users-boun...@oss.oracle.com] On Behalf Of Sunil Mushran Sent: Wednesday, October 21, 2009 12:23 PM To: David Murphy Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Unable to mount cluster on CentOS and Ubunut

Re: [Ocfs2-users] FW: Oracle 9204 installation on linux x86-64 on ocfs

2009-10-21 Thread Sunil Mushran
Appears the volumes are not mounted with the datavolume mount option. Refer to the FAQ for more in this. On May 4, 2009, at 7:37 AM, keyur patel keyur.pa...@oracle.com wrote: Hello All, I have installed Oracle Cluster Manager on linux x86-64 nit. I am using ocfs file system for quorum

Re: [Ocfs2-users] ssh user equivalence

2009-10-20 Thread Sunil Mushran
McKinley, Reid wrote: We are getting some pushback from our SA group regarding root having ssh user equivalence on the cluster where we run OCFS2. Is it required to have the ssh user equivalence setup for the root user when running OCFS2 ? The only place this comes in handy is

Re: [Ocfs2-users] ssh user equivalence

2009-10-20 Thread Sunil Mushran
. -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Tuesday, October 20, 2009 2:56 PM To: McKinley, Reid Cc: ocfs2-users@oss.oracle.com; Ramayani, Atul Subject: Re: [Ocfs2-users] ssh user equivalence McKinley, Reid wrote: We

Re: [Ocfs2-users] Multiple LUNs and ocfs2

2009-10-20 Thread Sunil Mushran
Currently ocfs2 does not support any lvm. It has to do with having a common cluster stack. That is still a work-in-progress. Bill Ranck wrote: Hello folks, I am a newbie with ocfs2 and Oracle VM in general, but I'm slowly figuring things out. I have installed ovm server on 2 blades in an

Re: [Ocfs2-users] ocfs2 control

2009-10-07 Thread Sunil Mushran
ocfs2 comes with its own clusterstack, o2cb. It does not need linux ha. Wilfredo Ocasio wrote: Hi, new to ocfs2, I have a 2 node Linux HA going and I need to know how to mount ocfs2 outside of HA control. It seems that only when you start ‘heartbeat’ will ocsf2 filesystem mounts. Any help

Re: [Ocfs2-users] Panic

2009-10-07 Thread Sunil Mushran
is not the NFS server. How should I proceed? What do you suggest? Could this happen again? On Wed, Oct 7, 2009 at 8:16 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: And does the node exporting the volume encounter the oops? If so, the likeliest

Re: [Ocfs2-users] Two node performance question

2009-10-05 Thread Sunil Mushran
that will be good to OCFS2 and IO. Thanks, Bruce A Leggett Director, IT Systems Amscot Financial 813-637-6283 -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Friday, September 18, 2009 4:30 PM To: Bruce Leggett Cc: 'ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] OCFS2 1.4 Problem on SuSE

2009-09-29 Thread Sunil Mushran
version of it is actually in the running kernel? Thanks, Angelo On Mon, Sep 28, 2009 at 2:35 PM, Sunil Mushran sunil.mush...@oracle.com wrote: Ping Novell for issues on SLES10. The error suggests that you are encountering novell bz#524683. This has been addressed in ocfs2 1.4.4. Ping

Re: [Ocfs2-users] OCFS2 Upgrade

2009-09-25 Thread Sunil Mushran
The ocfs2 and asm kernel drivers need to match the kernel version. If you have booted the box with the new kernel, do the following to learn package name you have to download and install. # echo ocfs2-`uname -r`-1.2.X-X.el4 # echo oracleasm-`uname -r`-2.0.X-1.el4 Then goto the sites and

[Ocfs2-users] OCFS2 1.4.4-1 and OCFS2 Tools 1.4.3-1 released

2009-09-25 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.4.4-1 and OCFS2 Tools 1.4.3-1 for Oracle's and Red Hat's Enterprise Linux 5 Update 2 and higher. Oracle's Unbreakable Linux Network users who are subscribing to the OCFS2 1.4 packages for Enterprise Linux 5 channel can upgrade to this

Re: [Ocfs2-users] OCFS2 Trace

2009-09-24 Thread Sunil Mushran
So a read on some file on a xfs volume, triggered a mem alloc which inturn triggered the kernel to free up some memory. The oops happens when it is trying to free up an ocfs2 inode. Do: # cat /proc/sys/kernel/panic_on_oops If this returns 0, do: # echo 1 /proc/sys/kernel/panic_on_oops This is

Re: [Ocfs2-users] OCFS2 Trace

2009-09-24 Thread Sunil Mushran
notice that the Cache Buffers on all the nodes are exhausted . This oops only happens currently on one of the nodes. I am relucatnt to force a reboot on oops. Is this a must? Thanks Laurence On Thu, Sep 24, 2009 at 8:06 PM, Sunil Mushran sunil.mush...@oracle.com

Re: [Ocfs2-users] OCFS2 Trace

2009-09-24 Thread Sunil Mushran
kernel. All bugs are fixed in enterprise and mainline. Critical bugs are fixed in stable-kernel(s). Laurence Mayer wrote: ok thank you. You mentioned the kernel being old, which kernel would you recommend at this point? On Thu, Sep 24, 2009 at 8:42 PM, Sunil Mushran sunil.mush

Re: [Ocfs2-users] strange fencing behavior

2009-09-24 Thread Sunil Mushran
You have to look at the logs of the fenced nodes. Setup netconsole to trap the kernel logs. On a side note, the logs show you are hitting the following bugzilla. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1053 Upgrade the tools to 1.4.2. Sean Thon wrote: I have 10 servers in a cluster

Re: [Ocfs2-users] Problem while setting up new system

2009-09-23 Thread Sunil Mushran
The fs driver is probably does not match the kernel version. Do: # echo ocfs2-`uname -r`-1.4.1-1.el5 That's the package that should be installed. Devender Narula wrote: Hi guys I got brand new system with RHEL 5. 3 I install below mention rpm packages rwxrwxrwx 1 root root 1427749

Re: [Ocfs2-users] another node is heartbeating in our slot

2009-09-23 Thread Sunil Mushran
You cannot share a device between two different clusters. Florin Andrei wrote: OCFS2 cluster, two nodes, nothing fancy: # [r...@serv1 ~]# cat /etc/ocfs2/cluster.conf node: ip_port = ip_address = 10.10.20.64 number = 0

Re: [Ocfs2-users] another node is heartbeating in our slot

2009-09-23 Thread Sunil Mushran
umount the fs on all nodes and run fsck.ocfs2 -f. Florin Andrei wrote: The underlying SAN LUN was temporarily attached to serv3, and was mounted. The question is - what are the recommended steps to repair the filesystem? Sunil Mushran wrote: You cannot share a device between two

Re: [Ocfs2-users] degrading performance

2009-09-20 Thread Sunil Mushran
The inode locality patches were added to resolve that performance issue. That was in 2.6.30, iirc. On Sep 19, 2009, at 9:41 PM, will whans...@sbcglobal.net wrote: I've been playing with ocfs2 in local mode, and was really suprised how fast it was. I was playing with extracting a linux

Re: [Ocfs2-users] Two node performance question

2009-09-18 Thread Sunil Mushran
Bruce Leggett wrote: Hi, I’m new to OCFS2 and have a question about my setup. I have two SLES 10 nodes (no patches yet) and a single OCFS2 volume. The volume receives live data via replication to keep it up to date. The goal is to have node one receive replicated date and all nodes run

Re: [Ocfs2-users] Two node performance question

2009-09-18 Thread Sunil Mushran
Bruce Leggett wrote: The FS performance is only slow from one node. Typically, writes and reads look good one node at a time, but not two. If concurrent io to the shared device is slow, then this would be expected. So your recommendation is I need to get throughout/IO boosted at the

Re: [Ocfs2-users] FREEZE x mount.ocfs2

2009-09-17 Thread Sunil Mushran
The freeze functionality has not pushed upstream as yet. We will do so in the coming months. Jeronimo Bezerra wrote: Hello All, I´m trying to test the new feature FREEZE from kernel 2.6.29 to snapshot (using Flash Copy in IBM Storage) a consistent filesystem state. Using a EXT3 the mount

Re: [Ocfs2-users] 10 Node OCFS2 Cluster - Performance

2009-09-15 Thread Sunil Mushran
200Megs) concurrently on 10 or more nodes to compare results? Thanks again Laurence Sunil Mushran wrote: Always cc ocfs2-users. Strange. The ocfs2 numbers look incomplete. It shows only 200MB written. You said it was taking 16 secs. Yet the iostat numbers are for 8 secs only

Re: [Ocfs2-users] 10 Node OCFS2 Cluster - Performance

2009-09-15 Thread Sunil Mushran
:1048576000 bytes (1.0 GB) copied, 54.2202 s, 19.3 MB/s real 1m6.979s user 0m0.010s sys 0m0.010s Do you think the hardware cannot handle the load? On Tue, Sep 15, 2009 at 7:53 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: All clusters are running release

Re: [Ocfs2-users] 10 Node OCFS2 Cluster - Performance

2009-09-15 Thread Sunil Mushran
On Tue, Sep 15, 2009 at 8:58 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: Is the o2cb interconnect and iscsi sharing the same network channel? Laurence Mayer wrote: *1 x Node*: r...@n1 mailto:r...@n1 mailto:r...@n1:~# dd if=/dev

Re: [Ocfs2-users] Mounting multiple ocfs2-volumes and disk heartbeat

2009-09-14 Thread Sunil Mushran
Sure. The default o2cb heartbeat is tied to the mounts. So in your case it will hb on both paths. Pickel, Gerd - Betax-Systems GmbH (IT) wrote: Hi, at the moment we run a two node cluster on SLES10 SP2 and OCFS2 1.4 with a Sun StorageTek 2540FC as shared disk. For performance reasons we

Re: [Ocfs2-users] kernel panic - BUG at dlmrecovery.c:2197

2009-09-14 Thread Sunil Mushran
Please can you file a bugzilla. http://oss.oracle.com/bugzilla Attach the netconsole logs from all nodes. Also mention some info about your cluster... num nodes, arch, mem, etc. The oops is due to an over zealous BUG_ON. Not really required. But it does hint at a possible race in o2dlm. File

Re: [Ocfs2-users] 10 Node OCFS2 Cluster - Performance

2009-09-14 Thread Sunil Mushran
On Mon, Sep 14, 2009 at 10:15 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: Add a sync. Both utils are showing very little io. And do the same for runs on both ocfs2 and xfs. # dd if... ; sync; Laurence Mayer wrote: Here

Re: [Ocfs2-users] Hidden Files

2009-09-10 Thread Sunil Mushran
hidden file? Are you encountering an issue whereby you are not recovering space after you have deleted a file? If so, then that is a known issue in 1.4.1. Part solution is to upgrade to 1.4.2. The full solution will be available with OCFS2 1.4.4 that is currently in testing. If you have

[Ocfs2-users] Quick update regarding (RH)EL5 U4

2009-09-09 Thread Sunil Mushran
All, Just to let you all know that packages for OCFS2 1.4.2-1 are available for EL5 U4 and RHEL5 U4. http://oss.oracle.com/pipermail/el-errata/2009-September/001140.html http://oss.oracle.com/projects/ocfs2/files/RedHat/RHEL5/ We will not be releasing packages for OCFS2 1.2 for (RH)EL5 U4. Users

Re: [Ocfs2-users] OCFS2 vs Oracle ASM CFS

2009-09-03 Thread Sunil Mushran
Jeremy Schneider wrote: Some quick questions for the ocfs2-users list, now that 11.2 is public. (Now that it's public knowledge that Oracle developed a whole new [propriatery] cluster filesystem from the ground up for RAC.) 1) Seems that the 11.2 docs encourage using ACFS rather than OCFS2

Re: [Ocfs2-users] Weird crash

2009-09-01 Thread Sunil Mushran
For issues on sles, please file a bug/sr with novell. The issue here is insufficient journal credits. It _could_ be that this version is missing mainline git commit e051fda4fd14fe878e6d2183b3a4640febe9e9a8. But I don't know. Novell Support will be better placed to track down the issue. Sérgio

Re: [Ocfs2-users] OCFS2 and Xen interoperability issue

2009-08-25 Thread Sunil Mushran
Late this year, early next year. Gonçalo Borges wrote: Hi Sunil Thanks for the reply. What's the time frame to release OCFS2 1.6? Cheers Goncalo On 08/25/2009 05:13 PM, Sunil Mushran wrote: So this is a known issue on OCFS2 1.4/(RH)EL5 combination. As in, this will _work_ on OCFS2 1.2

Re: [Ocfs2-users] Clear Node

2009-08-25 Thread Sunil Mushran
Can you describe the mount lock? You don't have to limit the mount to just one node. Have both nodes mount the volume but run mysql only on one node only. Sunil James Devine wrote: I am trying to make a mysql standby setup with 2 machines, one primary and one hot standby, which both share

Re: [Ocfs2-users] more ocfs2_delete_inode dmesg questions

2009-08-24 Thread Sunil Mushran
So a delete was called for some inodes that had not been orphaned. The pre-checks detected the same and correctly aborted the deletes. No harm done. No, the messages do not pinpoint the device. It's something we discussed adding, but have not done it as yet. Next time this happens and you can

Re: [Ocfs2-users] OCFS2 and Xen interoperability issue

2009-08-24 Thread Sunil Mushran
ocfs2 and kernel versions? Gonçalo Borges wrote: Hi All... I'm testing a Xen solution on an OCFS2 SAN to store VM images, and I'm observing an interoperability issue between two kinds of softwares. I've already tried to obtain some feedback from Xen experts without any success. Maybe I'm

Re: [Ocfs2-users] Ghost files in OCFS2 filesystem

2009-08-21 Thread Sunil Mushran
2.6.31-rc1. Brian Kroth wrote: I didn't see this in the bug list. Which mainline release is this fixed in? Thanks, Brian Sunil Mushran sunil.mush...@oracle.com 2009-08-20 17:46: Yes, this is a known issue in OCFS2 1.4.1 and 1.4.2. That is assuming no process in the cluster has

Re: [Ocfs2-users] Ghost files in OCFS2 filesystem

2009-08-20 Thread Sunil Mushran
Yes, this is a known issue in OCFS2 1.4.1 and 1.4.2. That is assuming no process in the cluster has that file open. We have the fix. It will be available with 1.4.3 which is in testing. This was discussed in the email announcing the 1.4.2 release.

Re: [Ocfs2-users] Kernel Version

2009-08-14 Thread Sunil Mushran
Sure. Raheel Akhtar wrote: Hi, I have existing OCFS2 cluster (1.4.2-1.el5) with kernel version RedHat 5.3 Enterprise kernel version 2.6.18-128.1.16.el5. I am going to add 2 new nodes with RedHat 5.3 enterprise kernel version 2.6.18-128.el5. My question Can I add 2 nodes with

Re: [Ocfs2-users] Shutdown to single user mode causes SysRq Reset

2009-08-13 Thread Sunil Mushran
This is a feature. ;) If you have mounted a volume on two or more nodes, the expectation is that the private interconnect will always remain up. If you shutdown the network on a node, the cluster stack will have to kill a node. It does so inorder to prevent hangs in cluster operations. In a 2

Re: [Ocfs2-users] ocfs2 configuration/performance questions...

2009-08-03 Thread Sunil Mushran
Peter W. Morreale wrote: Hi all, I'm trying to determine the performance implications of various configurations for ocfs2. (I'm new to ocfs2, but have read through all the docs for both 1.2 and 1.4, so please be gentle :) This would be a 1.4 installation. I searched through

<    1   2   3   4   5   6   7   8   9   10   >