Re: [Ocfs2-users] future of ocfs2

2009-02-06 Thread Sunil Mushran
One point I missed out earlier is that ocfs2 should not be viewed as an Oracle only product. When the fs was pushed into the mainline kernel, it was because we wanted it to become a community project. Like Linux itself. Until some time ago, we would credit non-oracle people who contributed patches

Re: [Ocfs2-users] Copying data from ocfs2 to an ocfs volume

2009-02-06 Thread Sunil Mushran
Unsure what the confusion is. The answer was correct. fscat can read certain unmounted fs. The list is mentioned on the home page. ext2, ext3, ocfs and ocfs2. It does not write to these file systems when they are unmounted. It can write to any mounted file system. Mehmet Can ÖNAL wrote: > Hi ever

Re: [Ocfs2-users] help with adding a node....

2009-02-06 Thread Sunil Mushran
Node 0 does not know about this node. Run the o2cb_ctl command on node 0 to add a node to a live cluster as listed in 1.4 user's guide. Andrew Deagman wrote: > I am receiving the following error when I attempt to add a node to the > cluster: > > o2net: connected to node bapp02 (num 5) at 10.10.16

Re: [Ocfs2-users] RAC OS Migration

2009-02-06 Thread Sunil Mushran
Ask Novell whether you should upgrade or reinstall the OS. It is afterall a large jump. From sles9 sp3 to sles10 sp2. But when you do the change, install the latest sles10 sp2 kernel. The file system can be used as is. There is no need to enable new features immediately. Most of them are not usef

Re: [Ocfs2-users] ASM over OCFS2 vs. Standard locally managed tablespaces

2009-02-07 Thread Sunil Mushran
OTN has a forum for ASM. Please post this qs on that forum. On Feb 7, 2009, at 6:51 AM, Karim Alkhayer wrote: > Hello All, > > > > Are there any benchmarks with respect to performance with respect to > ASM over OCFS2 vs. standard locally managed tablespaces? > > In our environment, data files

Re: [Ocfs2-users] Local mount in 1.4?

2009-02-09 Thread Sunil Mushran
Bruce MacDonald wrote: > I'm playing around with OCFS2 as a local filesystem, as an alternative > to using XFS for a multi-terabyte database. Using OEL 5.2 from the > demo DVD, I download the latest OCFS2 1.4 packages and format the > filesystem, but when I try to mount it, I get a "file not fo

Re: [Ocfs2-users] Local mount in 1.4?

2009-02-09 Thread Sunil Mushran
at 13:41, Sunil Mushran wrote: > >> Bruce MacDonald wrote: >>> I'm playing around with OCFS2 as a local filesystem, as an >>> alternative to using XFS for a multi-terabyte database. Using OEL >>> 5.2 from the demo DVD, I download the latest OCFS2 1.4 packa

Re: [Ocfs2-users] forcing ocfs2 NOT to reboot the server

2009-02-11 Thread Sunil Mushran
No, this is not configurable. We have to fence else the processes will hang. From your description it appears it is rebooting because the hb ios are not completing within the timeout. What is your current setting? O2CB_HEARTBEAT_THRESHOLD in /etc/sysconfig/o2cb. Mehmet Can ÖNAL wrote: > > *Hi ev

Re: [Ocfs2-users] No space left in device

2009-02-11 Thread Sunil Mushran
bug#6802646 The same fix is available in 1.2.9-1. http://oss.oracle.com/projects/ocfs2/news/article_18.html The bugzilla#1023 describes the problem. The above fix helps but does not fix the core issue entirely. We are working on further improving inode allocations. But the reality is, as long as w

Re: [Ocfs2-users] Local mount in 1.4?

2009-02-12 Thread Sunil Mushran
T deny SUPER off; > [r...@oeltest ~]# tail -3 /var/log/messages > Feb 11 23:37:13 OELTEST kernel: sdb: Write Protect is off > Feb 11 23:37:13 OELTEST kernel: SCSI device sdb: drive cache: write back > Feb 11 23:37:13 OELTEST kernel: sdb: sdb1 > > Any ideas? As I said before, th

Re: [Ocfs2-users] forcing ocfs2 NOT to reboot the server

2009-02-12 Thread Sunil Mushran
eason i asked this > question. Is there any tip or tricks that you would give? > > > > > > > > > > > > > > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, February 11, 2009 7:36 PM > To: Mehmet Can ÖNAL > Cc: ocfs2-users@oss.

Re: [Ocfs2-users] no free slots available

2009-02-12 Thread Sunil Mushran
$ tunefs.ocfs2 -N /dev/sdc1 Check the man pages for mkfs.ocfs2 and tunefs.ocfs2. You can also refer to the user's guide and the faq. Carl J. Benson wrote: > Hello. > > I'm trying to add a fifth openSUSE 11.1 node to my existing > cluster of four nodes. My software is: > > ocfs2-tools-1.4.1-6.9

Re: [Ocfs2-users] Local mount in 1.4?

2009-02-13 Thread Sunil Mushran
t; Thanks again for your help. Still nothing in dmesg. > > I did get 2 trace files, which I'm sending you separately. > > Cheers, >Bruce > > Sunil Mushran wrote: >> Do: >> >> $ dmesg -n8 >> $ debugfs.ocfs2 -l ENTRY EXIT SUPER all

Re: [Ocfs2-users] GFS2/OCFS2 scalability

2009-02-20 Thread Sunil Mushran
Andreas Dilger wrote: > On Feb 20, 2009 20:23 +0300, Kirill Kuvaldin wrote: >> I'm evaluating different cluster file systems that can work with large >> clustered environment, e.g. hundreds of nodes connected to a SAN over >> FC. >> >> So far I looked at OCFS2 and GFS2, they both worked nearly th

Re: [Ocfs2-users] GFS2/OCFS2 scalability

2009-02-23 Thread Sunil Mushran
Kirill Kuvaldin wrote: > What are the practical/theoretical limits for number of nodes for > shared disk file systems like ocfs2/gfs2? Theoretical limit is around 254 or so. Practical limit depends on the hardware. Meaning, you cannot just add nodes. You have to ensure the interconnect and the sto

Re: [Ocfs2-users] cluster.conf

2009-03-09 Thread Sunil Mushran
On Mon, Mar 09, 2009 at 11:11:13AM -0600, Bret Palsson wrote: > Can I have more than 255 nodes specified in the cluster.conf? No. 255 is the limit. > > I need to have 255 nodes on one volume which I have already setup, but > I have 3 more volumes that won't be connected by any of those 255 >

Re: [Ocfs2-users] Experiencing occasional system crashes with RHEL5 and ocfs2 1.2.9

2009-03-09 Thread Sunil Mushran
Known issue. We have a potential fix for it. It is in testing. How often do you hit this? On Mon, Mar 09, 2009 at 06:52:57PM -0400, Ward Fenton wrote: >We have been experiencing unplanned outages on a subset of the >clustered systems we have deployed to support SAP. The following >cap

Re: [Ocfs2-users] cluster rebooting

2009-03-13 Thread Sunil Mushran
Impossible to determine the cause with what you have provided. File a bugzilla and attach messages from all nodes. No exceptions. If you have netconsole setup (you should) attach those logs. That way we'll know if the nodes oopsed and if so what the stack was. Sunil On Fri, Mar 13, 2009 at 03:01:

Re: [Ocfs2-users] ocfs2_wait_for_mask

2009-03-24 Thread Sunil Mushran
Setup a netconsole server to catch the oops trace. Have you set /sys/kernel/panic_on_oops to 1? On Tue, Mar 24, 2009 at 06:09:22PM +0200, Cristian Gae wrote: > Hello > > We have a 9 nodes ocfs2 cluster used for http serving. > > Sometimes when we want to reboot one of the nodes, it happens to ke

Re: [Ocfs2-users] Upgrading debian etch to lenny causes host crash in a vmware.server 2.0 environment

2009-03-30 Thread Sunil Mushran
Setup netconsole to catch the oops log. The message you have provided shows the node death detection only. Not the cause of the the node death. The netconsole log of the oopsed node will tell us as to why it oopsed. Christoph Ackermann wrote: > Hello. > > We used a ten host cluster for a vmware-s

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-03 Thread Sunil Mushran
umount is supposed to stop the heartbeat. In bz1053, ocfs2_hb_ctl was segfaulting. Are you seeing any segfaults or any other errors during umount? Also, run the following before and after umount: $ ocfs2_hb_ctl -I -d /dev/sdX o2cb Email me the output. Nikola Ciprich wrote: > Hello Tao, > and th

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-03 Thread Sunil Mushran
_ctl -I -d /dev/vgshared/lvs > 2A5D351D0A934061BBC6B5392A30187E: 1 refs > [r...@vbox4 ~]# umount /home/LVS > [r...@vbox4 ~]# ocfs2_hb_ctl -I -d /dev/vgshared/lvs > 2A5D351D0A934061BBC6B5392A30187E: 1 refs > > nik > > On Fri, Apr 03, 2009 at 10:21:33AM -0700, Sunil Mushran wrote: > >> umount i

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-05 Thread Sunil Mushran
t - the binary is there... > n. > > On Fri, Apr 03, 2009 at 02:27:34PM -0700, Sunil Mushran wrote: >> Do: >> $ cat /proc/sys/fs/ocfs2/nm/hb_ctl_path >> >> >> Nikola Ciprich wrote: >>> Hi Sunil, >>> thanks for reply.. >>> I don't o

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-06 Thread Sunil Mushran
; Anyways Sunil thanks a lot for Your help! > > On Sun, Apr 05, 2009 at 07:31:52AM -0700, Sunil Mushran wrote: > >> Email me the ouput of: >> $ mounted.ocfs2 -d >> >> Also, does hb stop using uuid work? >> $ ocfs2_hb_ctl -K -u o2cb >> >> Lastly,

Re: [Ocfs2-users] Encountered disk I/O error 19502

2009-04-06 Thread Sunil Mushran
Yes. This issue has nothing to do with taf or asm. You are seeing transient EIOs during archiver writes. Follow Srini's suggestion of filing a SR and then pinging him. The other issue about instances altering the service names is unrelated. Atleast we should consider it unrelated as they have noth

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-13 Thread Sunil Mushran
s version 95, which doesn't > create them for me. > Version 127 I used for tests already creates them correctly.. > BR > nik > > On Mon, Apr 06, 2009 at 12:08:34PM -0700, Sunil Mushran wrote: > >> AFAIK, this is not an issue on (rh)el4/sles9. It could be that

Re: [Ocfs2-users] BUG: soft lockup - CPU#1 stuck for 61s

2009-04-20 Thread Sunil Mushran
File a bugzilla (oss.oracle.com/bugzilla) for this issue. Attach the stack trace. Also attach the output of the following. $ find /lib/modules/`uname -r`/kernel/fs/ocfs2 -name \*.ko -exec objdump -DSl {} >/tmp/ocfs2.syms \; Sunil Konstantin Tikhonov wrote: > Нi, > I have a cluster with 5 nodes

Re: [Ocfs2-users] kernel panic - ocfs2-1.4.1 - redhat EL5 2.6.18-92.el5xen

2009-04-24 Thread Sunil Mushran
Please file a bugzilla. Add this stack trace to it. http://oss.oracle.com/bugzilla Also add any detail about your environment that you feel could be relevant. Size of cluster, number of mounts, etc. Thanks Sunil James Masson wrote: > Hi list, > > We've just had a kernel panic/reboot on one of ou

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-27 Thread Sunil Mushran
While a node can only be in one cluster, it can be in many different dlm domains (or lock name spaces), each of which can have a different collection of nodes. But in the end, the sum total of all nodes in all domains will still part of one cluster. What are you trying to do? Sunil Søren Kröger

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-28 Thread Sunil Mushran
Søren Kröger wrote: > I'm trying to split up our big OCFS2 filesystem into 3 separate LUN's, > since there are only a limited amount of nodes which need access to > the different parts of the OCFS2 filesystem. > One "Master" server with RW access should still be able to mount all 3 > OCFS2 LUN's

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-28 Thread Sunil Mushran
ave mounted the particular filesystem? > > Søren > > On 28/04/2009, at 22.21, Sunil Mushran wrote: > >> Søren Kröger wrote: >>> I'm trying to split up our big OCFS2 filesystem into 3 separate >>> LUN's, since there are only a limited amount of nodes which

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-28 Thread Sunil Mushran
McKinley, Reid wrote: > > We have installed OCFS2 1.4.1 and for some reason we can only get the > mount point mounted on 1 of 2 nodes. The 2^nd node shows that the > heartbeat is not active. > > > > [r...@nyclx2 ~]# service o2cb status > > Driver for "configfs": Loaded > > Filesystem "configf

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-29 Thread Sunil Mushran
What does "mounted.ocfs2 -d" say on both nodes? Not /var/log/dmesg. It is /var/log/messages. You could instead run "dmesg". This is important as it will tell you why the mount failed. Sunil McKinley, Reid wrote: > Thank you! > > Everything appears to be fine then, except that we cannot mount an

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-29 Thread Sunil Mushran
pr 29 12:01:17 nyclx2 kernel: ocfs2: Unmounting device (8,0) on (node > 255) > > Thanks again, > Reid > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, April 29, 2009 4:32 PM > To: McKinley, Reid > Cc: ocfs2-users@oss.

Re: [Ocfs2-users] ocfs2 vs ext3?

2009-04-30 Thread Sunil Mushran
Andrew (Anything) wrote: > Ive been testing using bonnie++ -n 50:1024:0:10 -s 0. Is this a bad way to > test? > Some raw results follow later in case you want them. > > Obviously ocfs2 should be slower than ext3. > But I guess I expected a single node ocfs node to be only doing internal > stuff wit

Re: [Ocfs2-users] df & du - that old chestnut

2009-05-07 Thread Sunil Mushran
http://oss.oracle.com/~smushran/.debug/scripts/stat_sysdir.sh Please file a bugzilla and attach the output of the above script. http://oss.oracle.com/bugzilla I think I know the issue. No, it is not related to blocksize/clustersize. Sunil Nigel Bishop wrote: > > Afternoon, > > We have an ocfs2

Re: [Ocfs2-users] Max number of files?

2009-05-12 Thread Sunil Mushran
Gavin Hamill wrote: > ocfs2 1.4 has a maximum of 32000 files in any single directory - we got > bitten by this bug recently. If you're talking about 5 million files, > then is there's a possibility you've encountered this limit? Incorrect. The limit is 32000 sub-dirs in a directory. There is no sp

Re: [Ocfs2-users] Unable to fix corrupt directories with fsck.ocfs2

2009-05-13 Thread Sunil Mushran
Did you run fsck with the force flag? $ fsck.ocfs2 -f /dev/sdX By default, fsck only replays the journals. Paul Taylor wrote: > Hi > > errors like the one listed below have been coming through in our logs on > a daily basis. We tried to run fsck.ocfs2 over the file system bet it > thinks that i

Re: [Ocfs2-users] Debugging help / Guidance on architecture

2009-05-15 Thread Sunil Mushran
Damon Miller wrote: > We're running a 3-node OCFS2 1.2.9 cluster with a 5-TB iSCSI block device as > the backing store. All machines are running CentOS, with the iSCSI target > running CentOS 5.2 and the initiators running CentOS 4.7. The purpose of the > cluster is to evaluate alternatives to

Re: [Ocfs2-users] Debugging help / Guidance on architecture

2009-05-19 Thread Sunil Mushran
Damon Miller wrote: > The two servers are actually connected to the same switch. We are using > iptables for basic packet filtering on all of our hosts, but TCP/ is open > on all machines participating in the cluster. iSCSI is also enabled on > TCP/3260. Here are the relevant excerpts fro

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-20 Thread Sunil Mushran
Christian van Barneveld wrote: > Our OCFS2 cluster has been stable for approx 8 months, but since this week it > went wrong. First we had high load problems. We had this problem because a > couple of directories got filled with files, 1 directory over 1,5 milion > files (symlinks) and NFS (mount

Re: [Ocfs2-users] OCFS2 mount points will not automatically mount on boot

2009-05-20 Thread Sunil Mushran
McKinley, Reid wrote: > > We are trying to figure out why our OCFS2 mount points will not > automatically mount on reboot. > > We are on OEL version 2.6.18-92.el5 and we are using multipathing. > > Here are the OCFS2 entries in /etc/fstab. > > /dev/dm-2 /oracw oracle_clusterware datavolume,nointr,

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-20 Thread Sunil Mushran
Christian van Barneveld wrote: > No, I don't have the full output, but I still have the snapshots that I've > made before teh FSCK. I've mounted it at a different server and ran a > (readonly) FSCK. See attached output. The output shows i/o errors. It is unable to read the blocks beyond a certai

Re: [Ocfs2-users] [Fwd: Re: Unable to fix corrupt directories with fsck.ocfs2]

2009-05-20 Thread Sunil Mushran
Brian Kroth wrote: > Luis Freitas 2009-05-20 10:46: >>I am not aware of any filesystem that can withstand a online fsck. >>Sun ZFS can do online correction, but it doesnt have a fsck tool. > I hear btrfs will support this. It may be a feature that's easier to > accomplish with copy on wri

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-21 Thread Sunil Mushran
Brian Kroth wrote: > That's what's always held me back from doing this as well. Will the > common stack be the openais stack (ie: the so called user stack), the > o2cb stack, or something completely different? Currently the o2cb userspace clusterstack is directed towards supporting the native clu

Re: [Ocfs2-users] cluster manager does not start/run ERROR: OemInit2: Attempting to open the CMDiskFile for a multi-node RAC

2009-05-24 Thread Sunil Mushran
Use sles or (rh)el. It won't work on other distros. On May 24, 2009, at 3:43 PM, sundar mahadevan wrote: > To add some more details to this issue: > > /u01/oradata/orcl/orcl/cmquorumfile mentioned in the error message is > accessible from the shell prompt. I even checked the file > $ORACLE_HOME/

Re: [Ocfs2-users] o2cb Configure problem

2009-05-25 Thread Sunil Mushran
The ocfs2 kernel driver is missing. Read the user's guide or the FAQ to learn how to install the driver. On May 25, 2009, at 8:16 AM, Devender Narula wrote: > > HI Team > > When ever i try to configure o2cb . i get below mention proble.. its > a production installation and your quick help

Re: [Ocfs2-users] o2cb Configure problem

2009-05-25 Thread Sunil Mushran
> > ocfs2console-1.4.1-1.el5 > > ocfs2-tools-1.4.1-1.el5 > > [r...@eregtest2 software]# uname -a > > Linux eregtest2.admin.abdn.ac.uk 2.6.18-92.el5 #1 SMP Tue Apr 29 > 13:16:15 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux > > > --- On Mon, 5/25/09, Sunil Mushran wr

Re: [Ocfs2-users] OCFS2 & Using the private interconnect with jumbo frames for heartbeat

2009-05-26 Thread Sunil Mushran
There are no known issues with using ocfs2 on a jumbo frame enabled private network. On May 26, 2009, at 8:38 AM, Sridhar Avantsa wrote: > When using OCFS2 in a Oracle RAC set, one would configure OCFS2 to > use the private interconnect address ( in cluster.conf). > Are there any known issue

Re: [Ocfs2-users] Problem with OCFS2 on RHEL5.0 while installing CRS 10.2.01

2009-05-27 Thread Sunil Mushran
Refer to the section on oracle rdbms in the 1.4 user's guide. Specifically mount options. On May 27, 2009, at 4:55 AM, Devender Narula wrote: > > Hi team > > I had installed OCFS2 on RHEL5.0 . every thing looks fine but when I > was installing CRS on the node I got error message OCFS2 is n

Re: [Ocfs2-users] Cluster lockup when one node fails

2009-05-27 Thread Sunil Mushran
kernel version, ocfs2 version? $ uname -a $ modinfo ocfs2 $ rpm -qa | grep ocfs2 Kees Hoekzema wrote: > Hello List, > > At the moment I'm running a 7-node ocfs2 cluster on a Dell MD3000i (iscsi) > NAS. This cluster has run fine for well over a year now, but recently one of > the older and more u

Re: [Ocfs2-users] Cluster lockup when one node fails

2009-05-28 Thread Sunil Mushran
Kernel: 2.6.26-1-amd64 x86_64 > > modinfo ocfs2: > version:1.5.0 > description:OCFS2 1.5.0 > srcversion: B19D847BA86E871E41B7A64 > vermagic: 2.6.26-1-amd64 SMP mod_unload modversions > > ocfs2-tools: > Version: 1.4.1-1 > > Tia, > Kees Hoekzema

Re: [Ocfs2-users] Fwd: fsck fails & volume mount fails, is my data lost?

2009-05-29 Thread Sunil Mushran
You are using ocfs2 atop lvm - a non-cluster-aware volume manager. A lot of things can go wrong in this combination. Quite a few have been reported on this forum. debugfs.ocfs2 has commands dump and rdump that allows users to read the files directly off the disk. Use it to recover your data. khai

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > I am trying to figure out about locking under ocfs2. > > I read in the 1.4 ocfs2 pdf doc file that ocfs2 1.4 support flock but > not fcntl locks, and in a message, I read that ocfs2 rely on vfs for > fcntl, and I read something about being able to get fcntl locks but not > with o

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > If I will add pacemaker or cman package to my current OS, since > cluster fcntl is probably a new feature, is there a minimum version > number for fcntl to be supported? The support for clustered fcntl in ocfs2 was added in 2.6.27. ___

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > Thanks for your reply. > > Good to know... time to upgrade the kernel. > > Is there a minimum required pacemaker or cman version too? > With pacemaker does it need heartbeat or openais for this? or either? > And if it maters what is the minimum version required for openais or > hea

Re: [Ocfs2-users] Fwd: fsck fails & volume mount fails, is my data lost?

2009-06-01 Thread Sunil Mushran
khaije rock wrote: > 90% of my way through the recovery and it turns out that each of the > more volume-wide rdump attemps were chocking at the same specific > point: a symlink pointing to a directory on a different filesystem > that then descended back into the ocfs volume. > > Looking like thi

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
McKinley, Reid wrote: > > Ever since we have installed OCFS2, we have had extremely slow > performance in the ocfs2console. It can take us over 30 minutes to do > the simplest tasks. > > > > We do not have this type of performance with other xwindows > applications on our server. > > > > D

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
McKinley, Reid wrote: > No, "mounted.ocfs2 -d" comes back in less than 5 seconds. > > You said it taking 30 mins to do any action. Can you expand on that? As in, possibly walk us through your steps. ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
The connect requests are not getting through. Do you have any firewalls setup? Is iptables running? If so, either shut it down or allow traffic on the o2cb port. McKinley, Reid wrote: > > We are having trouble getting the 2^nd node in our 2 node RAC > configuration to have an active O2CB heartbea

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
minues. Then, I can select "format". > > Let me know if you need further details. > Thanks, > Reid > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, June 03, 2009 12:48 PM > To: McKinley, Reid > Cc: o

Re: [Ocfs2-users] Problems mounting ocfs2 on 2 nodes

2009-06-03 Thread Sunil Mushran
Which cluster stack are you using? o2cb or pacemaker? Shaffin Bhanji wrote: > I have a 2 node cluster setup but am at a loss as to why I cannot > mount a shared ocfs2 filesystem on both nodes being shared by iSCSI? > Am I wrong in understanding that this can be achieved? > > I am using OpenAIS und

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
]# service iptables status > Firewall is stopped. > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, June 03, 2009 12:57 PM > To: McKinley, Reid > Cc: ocfs2-users@oss.oracle.com > Subject: Re: [Ocfs2-users] O2CB heart

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
ed > Driver for "ocfs2_dlmfs": Loaded > Filesystem "ocfs2_dlmfs": Mounted > Checking O2CB cluster tiaa: Online > Heartbeat dead threshold = 31 > Network idle timeout: 3 > Network keepalive delay: 2000 > Network reconnect delay: 2000 > Checking O2CB

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
> node: > ip_port = > ip_address = 192.168.0.217 > number = 1 > name = nyclx2 > cluster = tiaa > > cluster: > node_count = 2 > name = tiaa > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@or

Re: [Ocfs2-users] Default Values of heartbeat dead threshold

2009-06-05 Thread Sunil Mushran
Actually, it is not complex. o2cb timeouts: If not using multipathing/netbonding, leave the timeouts as it. If using multipathing, double the disk hearbeat to 120 secs. If using netbonding, double the network idle to 60 secs. Ensure your private network has no loops to prevent spanning tree protoc

Re: [Ocfs2-users] ocfs2 in sles11 vs. sles10

2009-06-05 Thread Sunil Mushran
Are you sure you are mounting the same volume on both nodes? Do on both nodes: debugfs.ocfs2 -R "stats" /dev/sdX | grep UUID It should be the same. Ensure you don't have any local iscsi caching enabled. Bengtsson Anders wrote: > > I’m trying to mount a ocfs2 volume (created on sles11) on my sle

Re: [Ocfs2-users] ocfs2 fencing with multipath and dual channel HBA

2009-06-08 Thread Sunil Mushran
florian.engelm...@bt.com wrote: > We tried to use ocfs2 with Vserver clustered with Heartbeat. But > Vservers need barrier=1. That did not work on our shared storage with > ocfs2 but I guess this is no ocfs2 problem it is a device mapper problem > because we need to use multipath and LVM2, isn't it

Re: [Ocfs2-users] OCFS2 hosting and running binaries

2009-06-09 Thread Sunil Mushran
Sure. One can use ocfs2 to host almost anything. The one exception is the crs_home. crs_home needs to be on a local volume. OCFS 1.2/1.4 has two limits. Like ext3, the number of sub-directories in _a_ directory cannot exceed 32000. (There is no limit to the number of subdirs in a volume.) The othe

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-10 Thread Sunil Mushran
ensure iptables is either off or has rules for the interconnect traffic. use tcpdump to see if packets are coming thru. The connect request is initiated between two nodes is initiated when both of them first mount a common volume. Also, the connect request is always from the higher node to the low

Re: [Ocfs2-users] How to mount OCFS2 file systems using the EMC Power Path multipath device

2009-06-10 Thread Sunil Mushran
There is another scheme (less elegant but probably quicker to deploy) that was described in this list by a user. As root run blkid. Then edit /etc/blkid.tab and remove all sd devices that correspond to the emcpp devices. Ensure pp is enabled. Rerun blkid. This time you should see the pp devices in

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-11 Thread Sunil Mushran
iranha which invokes > iptables. What should I add to iptables to enable interconnect > traffic? > > TIA > > John > > On Wed, 2009-06-10 at 10:18 -0700, Sunil Mushran wrote: >> ensure iptables is either off or has rules for the interconnect >> traffic. >>

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-11 Thread Sunil Mushran
ree working nodes in order to run /etc/init.d/o2cb offline on each one > followed by /etc/init.d/o2cb start? > > Best Regards > > John > > On Thu, 2009-06-11 at 07:33 -0700, Sunil Mushran wrote: > >> Add a rule to allow traffic on port (or whatever it is

Re: [Ocfs2-users] OCFS2 1.4.1 DLM unhandled error

2009-06-16 Thread Sunil Mushran
Please file a bugzilla in oss.oracle.com/bugzilla. Saul Gabay wrote: > > We have a 2 node OCFS2 cluster running Oracle 10g, both nodes crashed. > > > > Node 1 because it panic running IOSTAT, the second node crashed with > this error message you can see below. > > > > I was hoping to see a ne

[Ocfs2-users] OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 released

2009-06-16 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 for Oracle's and Red Hat's Enterprise Linux 5 Update 2 and higher. Oracle's Unbreakable Linux Network users who are subscribing to the "OCFS2 1.4 packages for Enterprise Linux 5" channel can upgrade to this relea

Re: [Ocfs2-users] OCFS2 Caused RAC server to crash

2009-06-17 Thread Sunil Mushran
Please file a bugzilla and _attach_ this oops trace. Also mention all the version numbers. On Jun 17, 2009, at 2:30 AM, "McDonald, Stuart" > wrote: Hi We have a two-node RAC cluster, which uses ASM for the database storage, but is using OCFS2 to mount a couple of file systems for a) th

Re: [Ocfs2-users] [Ocfs2-announce] OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 released

2009-06-18 Thread Sunil Mushran
Brian Kroth wrote: > Sunil Mushran 2009-06-16 16:38: > >> LOOKING AHEAD >> >> We are aiming to release OCFS2 1.6 later this year. This release will >> include the features that we have worked on over the past year. These are: >> >> 1. Extended Attrib

Re: [Ocfs2-users] ocfs2 quota support

2009-06-22 Thread Sunil Mushran
Senmiao Chen wrote: > Whenever I tried to mount an OCFS2 volume I got the following message, > "ocfs2_fill_super:1016 ERROR: User quotas were requested, but this > filesystem does not have the feature enabled." Just wonder how to enable > this feature. The ocfs2 wiki says "you'll need support in

Re: [Ocfs2-users] issues with shared disk

2009-06-24 Thread Sunil Mushran
ocfs2 is a shared disk cluster file system. All nodes need to have access to the disk. Typical technologies involved are fiber channel and iscsi. If you are in a virtual environment, then you could present a local device as shared to multiple guests. On Jun 24, 2009, at 2:51 AM, "sheri...@ho

Re: [Ocfs2-users] o2net_connect_expired:1637 ERROR

2009-06-24 Thread Sunil Mushran
Not sure why you think it is trying to connect to itself. o2net connects to other nodes only. Raheel Akhtar wrote: > > Hi, > > I have OCFS2 Cluster of 5 nodes running on RHEL 5.2 (kernel version > 2.6.18-128.1.10.el5). I am getting error like > > Jun 24 09:26:54 alf2 kernel: (2095,0):o2net_connec

Re: [Ocfs2-users] Unexplained reboots in DRBD82 + OCFS2 setup

2009-06-24 Thread Sunil Mushran
Do you have a separate network path for drbd traffic? If you do not, then you are probably overloading the network. In this case, I believe drbd is unable to replicate the ios fast enough and thus is blocking the o2cb disk heartbeat. One workaround is to increase the O2CB_HEARTBEAT_THRESHOLD to mor

Re: [Ocfs2-users] ocfs2 freeze

2009-06-24 Thread Sunil Mushran
The nodes are not frozen. The processes that are attempting to talk to the "disconnected" node are waiting for that node to reply, failing which, to die. The default timeout for the disk heartbeat is 60 secs. If that node simply died, the other nodes would have deemed the node dead after 60 secs,

Re: [Ocfs2-users] Unexplained reboots in DRBD82 + OCFS2 setup

2009-06-25 Thread Sunil Mushran
logs. Kris Buytaert wrote: > On Wed, 2009-06-24 at 12:02 -0700, Sunil Mushran wrote: > >> Do you have a separate network path for drbd traffic? If you do >> not, then you are probably overloading the network. In this case, >> I believe drbd is unable to replicate the

Re: [Ocfs2-users] ocfs2 freeze

2009-06-25 Thread Sunil Mushran
s: > Jun 24 11:43:01 node5 kernel: [855031.567140] > (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107 > Jun 24 11:43:01 node5 kernel: [855031.567140] > (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107 > > The problem is that I couldn't acce

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
Explain "repository built by OCFS2 1.4.x"? What is this repository? OCFS2, the file system, is architecture neutral. Meaning, it works across 32-bit, 64-bit, little endian and big endian boxes. One can mount an ocfs2 volume concurrently on x86, x86_64, ia64 and ppc64 nodes. They all have to be Lin

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
to their repository. Having said that, a 64-bit system does not necessarily imply 64-bit apps. The apps itself could be 32-bit. Sunil Mushran wrote: > Yes. > > But do read the notes section in the ocfs2 user's guide. It talks > about having nodes with different compute power in a cl

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
ng documents on Repository which > is mounted with OCFS2. > > That mean I can add 64bit Red Hat node in current 32 bit OCFS2 cluster? > > Thanks > > > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Thursday, J

Re: [Ocfs2-users] System reboots

2009-07-04 Thread Sunil Mushran
Setup netconsole to trap the logs. Once you have it then file a bugzilla and attach the logs of all 5 nodes. On Jul 4, 2009, at 7:46 PM, Raheel Akhtar wrote: > Hi, > > > > I am using OCFS2 1.4.2-1 for RedHat Linux 5.2 64 bit > (2.6.18-128.1.6.el5) for 5 nodes. I notices sometime system just

Re: [Ocfs2-users] System reboots

2009-07-04 Thread Sunil Mushran
Google it. On Jul 4, 2009, at 7:58 PM, Raheel Akhtar wrote: > Hi Sunil, > > Do you mind to send me link how to setup netconsole on RedHat I > never did > before. > Highly appreciated. > > Raheel > > > -----Original Message- > From: Sunil Mushran [mai

Re: [Ocfs2-users] umount hang + high CPU

2009-07-06 Thread Sunil Mushran
Fixed. Details in http://oss.oracle.com/bugzilla/show_bug.cgi?id=914 syla...@aim.com wrote: > > Hi, > > On kernel 2.6.30 (and I have upgraded drbd there too to 8.3.2) I have > nothing in the logs, and the umount hangs, and after a few minutes the > whole computer hangs, and I have to hard r

Re: [Ocfs2-users] umount hang + high CPU

2009-07-07 Thread Sunil Mushran
The fix was for the oops you saw. The hang is a different issue. We have no info on that. For that, if you would like to diagnose the problem, read up the dlm notes in the 1.4 user's guide. It explains a debugging process vis-a-vis hangs. If the issue is dlm related, then we would like to have t

Re: [Ocfs2-users] umount hang + high CPU

2009-07-07 Thread Sunil Mushran
a noob, so I don't > even know if it is a bug, or a misconfiguration, or a misunderstanding. > > PS. Is nodiratime option supported for mounts? I used it, but I don't > see it in the user-guide. > > -Original Message- > From: Sunil Mushran > To: sylarrr

Re: [Ocfs2-users] ocfs2 acl issue

2009-07-15 Thread Sunil Mushran
Please do remember to file a bugzilla. Once it is fixed, add the git commit details to it. On Jul 15, 2009, at 4:15 AM, Tiger Yang wrote: > Hi, Marco, > > Thanks a lot, it is really a bug in ocfs2 acl. I can reproduce it now > and find the cause. I will send a patch to fix it after done some

Re: [Ocfs2-users] CentOS-5.3 + DRBD-8.2 + OCFS2-1.4

2009-07-15 Thread Sunil Mushran
This patch should be in ocfs2-1.4. It is not in ocfs2-1.2. Ignore the kernel version. Both 1.2 and 1.4 work on el5. On Jul 15, 2009, at 3:41 AM, Kevin Clark wrote: > I've run into a problem mounting an OCFS2 filesystem on a DRBD > device. I think it's the same one discussed at > http://l

Re: [Ocfs2-users] ocfs2 acl issue

2009-07-15 Thread Sunil Mushran
Please file a bugzilla @ oss.oracle.com/bugzilla Attach this to it. On Jul 15, 2009, at 6:37 PM, Marco Huang wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > Hi Tiger, > > I am also exporting the ocfs2 file system via nfs (with acl) to other > servers. I am getting the following k

Re: [Ocfs2-users] OCFS2 Node restart

2009-07-22 Thread Sunil Mushran
Please file a bugzilla and attach the netconsole logs of all six nodes. The messages provided indicate that that node saw the two nodes become unresponsive. As to why they became unresponsive will be known only after we see the netconsole logs of the two nodes. Raheel Akhtar wrote: > > Hi, > >

Re: [Ocfs2-users] Error message whil booting system

2009-07-29 Thread Sunil Mushran
ocfs2_stackglue not found error message is harmless. We use the same init script for all versions of the fs stackglue is present in the current mainline and will be in ocfs2 1.6. Raheel Akhtar wrote: > > Hi, > > When system booting getting error message “modprobe: FATAL: Module > ocfs2_stackg

Re: [Ocfs2-users] Error message whil booting system

2009-07-29 Thread Sunil Mushran
11:09:45 alf1 kernel: ocfs2_dlm: Nodes in domain > ("7BE7E9E2026A40F8801B56257D805C88"): 0 1 2 3 4 5 > -- > > > > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, July 29, 2009 1:25 PM

Re: [Ocfs2-users] Problems with umounting ocfs2 volume

2009-07-30 Thread Sunil Mushran
"device busy" could be because you have a shell having that as the cwd. Check: ls -l /proc/[0-9]*/cwd Georg Höllrigl wrote: > Hello, > > I've several LUNs mounted in a 7 node cluster - one LUN which is only used on > 4 of the nodes. > > It's impossible to umount this LUN - I'm always getting devi

Re: [Ocfs2-users] ocfs2 configuration/performance questions...

2009-08-03 Thread Sunil Mushran
Peter W. Morreale wrote: > Hi all, > > I'm trying to determine the performance implications of various > configurations for ocfs2. (I'm new to ocfs2, but have read through all > the docs for both 1.2 and 1.4, so please be gentle :) This would be a > 1.4 installation. > > I searched through www.

<    4   5   6   7   8   9   10   11   12   13   >