Re: [Ocfs2-users] kernel panic - bug in dlmglue.c ?

2009-09-11 Thread Sunil Mushran
Yes. The fix will be in 1.4.4. On Sep 11, 2009, at 7:20 AM, Tao Ma wrote: > Hi John, > John McNulty wrote: >> Hi, >> >> I had a system crash last night. Netconsole caught the following >> trace dump. Has this one been seen before? >> > This bug is fixed in mainline and should show up in next o

Re: [Ocfs2-users] Hidden Files

2009-09-10 Thread Sunil Mushran
hidden file? Are you encountering an issue whereby you are not recovering space after you have deleted a file? If so, then that is a known issue in 1.4.1. Part solution is to upgrade to 1.4.2. The full solution will be available with OCFS2 1.4.4 that is currently in testing. If you have support,

[Ocfs2-users] Quick update regarding (RH)EL5 U4

2009-09-09 Thread Sunil Mushran
All, Just to let you all know that packages for OCFS2 1.4.2-1 are available for EL5 U4 and RHEL5 U4. http://oss.oracle.com/pipermail/el-errata/2009-September/001140.html http://oss.oracle.com/projects/ocfs2/files/RedHat/RHEL5/ We will not be releasing packages for OCFS2 1.2 for (RH)EL5 U4. Users

Re: [Ocfs2-users] Ocfs2 for RHEL 5.4

2009-09-08 Thread Sunil Mushran
Hopefully later today (Sep 8th). Robin Garner wrote: > Any indications as to when ocfs2 will be available for RHEL 5.4 ? > > Thanks, > Robin > > ___ > Ocfs2-users mailing list > Ocfs2-users@oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-us

Re: [Ocfs2-users] OCFS2 vs Oracle ASM CFS

2009-09-03 Thread Sunil Mushran
Jeremy Schneider wrote: > Some quick questions for the ocfs2-users list, now that 11.2 is public. > (Now that it's public knowledge that Oracle developed a whole new > [propriatery] cluster filesystem from the ground up for RAC.) > > 1) Seems that the 11.2 docs encourage using ACFS rather than OCF

Re: [Ocfs2-users] Weird crash

2009-09-01 Thread Sunil Mushran
For issues on sles, please file a bug/sr with novell. The issue here is insufficient journal credits. It _could_ be that this version is missing mainline git commit e051fda4fd14fe878e6d2183b3a4640febe9e9a8. But I don't know. Novell Support will be better placed to track down the issue. Sérgio S

Re: [Ocfs2-users] Clear Node

2009-08-25 Thread Sunil Mushran
Can you describe the mount lock? You don't have to limit the mount to just one node. Have both nodes mount the volume but run mysql only on one node only. Sunil James Devine wrote: > I am trying to make a mysql standby setup with 2 machines, one primary > and one hot standby, which both share di

Re: [Ocfs2-users] OCFS2 and Xen interoperability issue

2009-08-25 Thread Sunil Mushran
Late this year, early next year. Gonçalo Borges wrote: > Hi Sunil > > Thanks for the reply. What's the time frame to release OCFS2 1.6? > > Cheers > Goncalo > > > On 08/25/2009 05:13 PM, Sunil Mushran wrote: >> So this is a known issue on OCFS2 1.4/(RH)EL5 c

Re: [Ocfs2-users] OCFS2 and Xen interoperability issue

2009-08-25 Thread Sunil Mushran
So this is a known issue on OCFS2 1.4/(RH)EL5 combination. As in, this will _work_ on OCFS2 1.2 on the same kernel and _should_ work with OCFS2 bundled with the mainline kernels. But not on the specific combination you are using. In short, the xm save/dump-core implementation in 2.6.18 is hacky. A

Re: [Ocfs2-users] OCFS2 and Xen interoperability issue

2009-08-24 Thread Sunil Mushran
ocfs2 and kernel versions? Gonçalo Borges wrote: > Hi All... > > I'm testing a Xen solution on an OCFS2 SAN to store VM images, > and I'm observing an interoperability issue between two kinds of > softwares. > > I've already tried to obtain some feedback from Xen experts without > any success. May

Re: [Ocfs2-users] more ocfs2_delete_inode dmesg questions

2009-08-24 Thread Sunil Mushran
So a delete was called for some inodes that had not been orphaned. The pre-checks detected the same and correctly aborted the deletes. No harm done. No, the messages do not pinpoint the device. It's something we discussed adding, but have not done it as yet. Next time this happens and you can ide

Re: [Ocfs2-users] Ghost files in OCFS2 filesystem

2009-08-21 Thread Sunil Mushran
2.6.31-rc1. Brian Kroth wrote: > I didn't see this in the bug list. Which mainline release is this fixed > in? > > Thanks, > Brian > > Sunil Mushran 2009-08-20 17:46: > >> Yes, this is a known issue in OCFS2 1.4.1 and 1.4.2. That is assuming >> no

Re: [Ocfs2-users] Ghost files in OCFS2 filesystem

2009-08-20 Thread Sunil Mushran
Yes, this is a known issue in OCFS2 1.4.1 and 1.4.2. That is assuming no process in the cluster has that file open. We have the fix. It will be available with 1.4.3 which is in testing. This was discussed in the email announcing the 1.4.2 release. ==

Re: [Ocfs2-users] Kernel Version

2009-08-14 Thread Sunil Mushran
Sure. Raheel Akhtar wrote: > > Hi, > > > > I have existing OCFS2 cluster (1.4.2-1.el5) with kernel version > RedHat 5.3 Enterprise kernel version 2.6.18-128.1.16.el5. I am going > to add 2 new nodes with RedHat 5.3 enterprise kernel version > 2.6.18-128.el5. > > > > My question Can I add 2

Re: [Ocfs2-users] Shutdown to single user mode causes SysRq Reset

2009-08-13 Thread Sunil Mushran
This is a feature. ;) If you have mounted a volume on two or more nodes, the expectation is that the private interconnect will always remain up. If you shutdown the network on a node, the cluster stack will have to kill a node. It does so inorder to prevent hangs in cluster operations. In a 2 nod

Re: [Ocfs2-users] error mounting ocfs2 mountpoints

2009-08-10 Thread Sunil Mushran
The nodes are failing to connect to each other on that interface. Ensure the ips are correct. The firewalls are disabled or have the rules to enable the traffic on port (or whatever you have set). You can use "ping -I ethX ipaddress" to check whether the ip is valid. Saranya Sivakumar wrote:

Re: [Ocfs2-users] Quantifying OCFS2 network traffic...

2009-08-06 Thread Sunil Mushran
With the o2cb stack, the network traffic will mostly be dlm related. Lock co-ordination. No ios. With the pacemaker stack, you will have some traffic required for ordered messaging too. I don't have the details on it. Again, no ios. For o2cb, the packets should be small. Tiny infact. But there ca

Re: [Ocfs2-users] OCFS2 mount points not automatically mounting onserver reboot

2009-08-06 Thread Sunil Mushran
When are the /dev/mapper paths created? McKinley, Reid wrote: > > I should mention that the OCFS2 mount points mount fine when the > manual mount is done after the reboot. > > Cmds for manual mounting: > > > > mount -o datavolume,nointr,_netdev,noatime -t ocfs2 /dev/mapper/mpath0 > /u02 > > mo

Re: [Ocfs2-users] ocfs2 question

2009-08-05 Thread Sunil Mushran
t; clusters mount the same device as long as the ocfs2 versions match? > > Regards, > Saranya Sivakumar > > > -------- > *From:* Sunil Mushran > *To:* Saranya Sivakumar > *Sent:* Wednesday, August 5, 2009 12:

Re: [Ocfs2-users] OCFS2 2.6.25

2009-08-05 Thread Sunil Mushran
Raheel Akhtar wrote: > That mean for kernel 2.6.30 (current stable kernel ) version, I don't need > to download ocfs2, just download tools and console?, because ocfs2 is only > available for 2.6.18.x kernel. The mainline kernel includes ocfs2, the file system. You have to of course build it. The

Re: [Ocfs2-users] OCFS2 2.6.25

2009-08-05 Thread Sunil Mushran
Raheel Akhtar wrote: > > Hi, > > According to this site, http://oss.oracle.com/osswiki/OCFS2/Roadmap > cluster ware flock feature is available in (2.6.25) for Red hat 5.2 U2. > > But on download page of OCFS2 website I didn’t find 2.6.25 version for > 64 bit Red Hat Linux 5.2. > > Current downloa

Re: [Ocfs2-users] barrier mount option?

2009-08-05 Thread Sunil Mushran
Peter W. Morreale wrote: > Can someone explain the implications of the "barrier=[0,1]" mount option > in OCFS2? A common misconception with disks is that when it acknowledges a write, that that write has hit the platter. This is not the case with disks that have their own caches. Earlier it was n

Re: [Ocfs2-users] ocfs2 question

2009-08-04 Thread Sunil Mushran
ocfs2 1.2.3 is 3 years old. Suggest you upgrade that to 1.2.9. 1.2.3 and 1.2.9 are not network compatible. The mount will fail. Saranya Sivakumar wrote: > Hi, > We have a shared backup storage that resides on EMC storage and > mounted using ocfs2 1.2.3 on a physical standby database in productio

Re: [Ocfs2-users] ocfs2 configuration/performance questions...

2009-08-03 Thread Sunil Mushran
Peter W. Morreale wrote: > So, regardless of the blocksize the file system was configured with, HB > only looks at a sector per node, is that right? > Yes. The hb thread figures out the sector size using ioctl(BLSSZGET). Idea is to minimize the hb io to the minimum. ___

Re: [Ocfs2-users] ocfs2 configuration/performance questions...

2009-08-03 Thread Sunil Mushran
Peter W. Morreale wrote: > Hi all, > > I'm trying to determine the performance implications of various > configurations for ocfs2. (I'm new to ocfs2, but have read through all > the docs for both 1.2 and 1.4, so please be gentle :) This would be a > 1.4 installation. > > I searched through www.

Re: [Ocfs2-users] Problems with umounting ocfs2 volume

2009-07-30 Thread Sunil Mushran
"device busy" could be because you have a shell having that as the cwd. Check: ls -l /proc/[0-9]*/cwd Georg Höllrigl wrote: > Hello, > > I've several LUNs mounted in a 7 node cluster - one LUN which is only used on > 4 of the nodes. > > It's impossible to umount this LUN - I'm always getting devi

Re: [Ocfs2-users] Error message whil booting system

2009-07-29 Thread Sunil Mushran
11:09:45 alf1 kernel: ocfs2_dlm: Nodes in domain > ("7BE7E9E2026A40F8801B56257D805C88"): 0 1 2 3 4 5 > -- > > > > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, July 29, 2009 1:25 PM

Re: [Ocfs2-users] Error message whil booting system

2009-07-29 Thread Sunil Mushran
ocfs2_stackglue not found error message is harmless. We use the same init script for all versions of the fs stackglue is present in the current mainline and will be in ocfs2 1.6. Raheel Akhtar wrote: > > Hi, > > When system booting getting error message “modprobe: FATAL: Module > ocfs2_stackg

Re: [Ocfs2-users] OCFS2 Node restart

2009-07-22 Thread Sunil Mushran
Please file a bugzilla and attach the netconsole logs of all six nodes. The messages provided indicate that that node saw the two nodes become unresponsive. As to why they became unresponsive will be known only after we see the netconsole logs of the two nodes. Raheel Akhtar wrote: > > Hi, > >

Re: [Ocfs2-users] ocfs2 acl issue

2009-07-15 Thread Sunil Mushran
Please file a bugzilla @ oss.oracle.com/bugzilla Attach this to it. On Jul 15, 2009, at 6:37 PM, Marco Huang wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > Hi Tiger, > > I am also exporting the ocfs2 file system via nfs (with acl) to other > servers. I am getting the following k

Re: [Ocfs2-users] CentOS-5.3 + DRBD-8.2 + OCFS2-1.4

2009-07-15 Thread Sunil Mushran
This patch should be in ocfs2-1.4. It is not in ocfs2-1.2. Ignore the kernel version. Both 1.2 and 1.4 work on el5. On Jul 15, 2009, at 3:41 AM, Kevin Clark wrote: > I've run into a problem mounting an OCFS2 filesystem on a DRBD > device. I think it's the same one discussed at > http://l

Re: [Ocfs2-users] ocfs2 acl issue

2009-07-15 Thread Sunil Mushran
Please do remember to file a bugzilla. Once it is fixed, add the git commit details to it. On Jul 15, 2009, at 4:15 AM, Tiger Yang wrote: > Hi, Marco, > > Thanks a lot, it is really a bug in ocfs2 acl. I can reproduce it now > and find the cause. I will send a patch to fix it after done some

Re: [Ocfs2-users] umount hang + high CPU

2009-07-07 Thread Sunil Mushran
a noob, so I don't > even know if it is a bug, or a misconfiguration, or a misunderstanding. > > PS. Is nodiratime option supported for mounts? I used it, but I don't > see it in the user-guide. > > -Original Message- > From: Sunil Mushran > To: sylarrr

Re: [Ocfs2-users] umount hang + high CPU

2009-07-07 Thread Sunil Mushran
The fix was for the oops you saw. The hang is a different issue. We have no info on that. For that, if you would like to diagnose the problem, read up the dlm notes in the 1.4 user's guide. It explains a debugging process vis-a-vis hangs. If the issue is dlm related, then we would like to have t

Re: [Ocfs2-users] umount hang + high CPU

2009-07-06 Thread Sunil Mushran
Fixed. Details in http://oss.oracle.com/bugzilla/show_bug.cgi?id=914 syla...@aim.com wrote: > > Hi, > > On kernel 2.6.30 (and I have upgraded drbd there too to 8.3.2) I have > nothing in the logs, and the umount hangs, and after a few minutes the > whole computer hangs, and I have to hard r

Re: [Ocfs2-users] System reboots

2009-07-04 Thread Sunil Mushran
Google it. On Jul 4, 2009, at 7:58 PM, Raheel Akhtar wrote: > Hi Sunil, > > Do you mind to send me link how to setup netconsole on RedHat I > never did > before. > Highly appreciated. > > Raheel > > > -----Original Message- > From: Sunil Mushran [mai

Re: [Ocfs2-users] System reboots

2009-07-04 Thread Sunil Mushran
Setup netconsole to trap the logs. Once you have it then file a bugzilla and attach the logs of all 5 nodes. On Jul 4, 2009, at 7:46 PM, Raheel Akhtar wrote: > Hi, > > > > I am using OCFS2 1.4.2-1 for RedHat Linux 5.2 64 bit > (2.6.18-128.1.6.el5) for 5 nodes. I notices sometime system just

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
ng documents on Repository which > is mounted with OCFS2. > > That mean I can add 64bit Red Hat node in current 32 bit OCFS2 cluster? > > Thanks > > > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Thursday, J

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
to their repository. Having said that, a 64-bit system does not necessarily imply 64-bit apps. The apps itself could be 32-bit. Sunil Mushran wrote: > Yes. > > But do read the notes section in the ocfs2 user's guide. It talks > about having nodes with different compute power in a cl

Re: [Ocfs2-users] from 32bit to 64bit

2009-07-02 Thread Sunil Mushran
Explain "repository built by OCFS2 1.4.x"? What is this repository? OCFS2, the file system, is architecture neutral. Meaning, it works across 32-bit, 64-bit, little endian and big endian boxes. One can mount an ocfs2 volume concurrently on x86, x86_64, ia64 and ppc64 nodes. They all have to be Lin

Re: [Ocfs2-users] ocfs2 freeze

2009-06-25 Thread Sunil Mushran
s: > Jun 24 11:43:01 node5 kernel: [855031.567140] > (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107 > Jun 24 11:43:01 node5 kernel: [855031.567140] > (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107 > > The problem is that I couldn't acce

Re: [Ocfs2-users] Unexplained reboots in DRBD82 + OCFS2 setup

2009-06-25 Thread Sunil Mushran
logs. Kris Buytaert wrote: > On Wed, 2009-06-24 at 12:02 -0700, Sunil Mushran wrote: > >> Do you have a separate network path for drbd traffic? If you do >> not, then you are probably overloading the network. In this case, >> I believe drbd is unable to replicate the

Re: [Ocfs2-users] ocfs2 freeze

2009-06-24 Thread Sunil Mushran
The nodes are not frozen. The processes that are attempting to talk to the "disconnected" node are waiting for that node to reply, failing which, to die. The default timeout for the disk heartbeat is 60 secs. If that node simply died, the other nodes would have deemed the node dead after 60 secs,

Re: [Ocfs2-users] Unexplained reboots in DRBD82 + OCFS2 setup

2009-06-24 Thread Sunil Mushran
Do you have a separate network path for drbd traffic? If you do not, then you are probably overloading the network. In this case, I believe drbd is unable to replicate the ios fast enough and thus is blocking the o2cb disk heartbeat. One workaround is to increase the O2CB_HEARTBEAT_THRESHOLD to mor

Re: [Ocfs2-users] o2net_connect_expired:1637 ERROR

2009-06-24 Thread Sunil Mushran
Not sure why you think it is trying to connect to itself. o2net connects to other nodes only. Raheel Akhtar wrote: > > Hi, > > I have OCFS2 Cluster of 5 nodes running on RHEL 5.2 (kernel version > 2.6.18-128.1.10.el5). I am getting error like > > Jun 24 09:26:54 alf2 kernel: (2095,0):o2net_connec

Re: [Ocfs2-users] issues with shared disk

2009-06-24 Thread Sunil Mushran
ocfs2 is a shared disk cluster file system. All nodes need to have access to the disk. Typical technologies involved are fiber channel and iscsi. If you are in a virtual environment, then you could present a local device as shared to multiple guests. On Jun 24, 2009, at 2:51 AM, "sheri...@ho

Re: [Ocfs2-users] ocfs2 quota support

2009-06-22 Thread Sunil Mushran
Senmiao Chen wrote: > Whenever I tried to mount an OCFS2 volume I got the following message, > "ocfs2_fill_super:1016 ERROR: User quotas were requested, but this > filesystem does not have the feature enabled." Just wonder how to enable > this feature. The ocfs2 wiki says "you'll need support in

Re: [Ocfs2-users] [Ocfs2-announce] OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 released

2009-06-18 Thread Sunil Mushran
Brian Kroth wrote: > Sunil Mushran 2009-06-16 16:38: > >> LOOKING AHEAD >> >> We are aiming to release OCFS2 1.6 later this year. This release will >> include the features that we have worked on over the past year. These are: >> >> 1. Extended Attrib

Re: [Ocfs2-users] OCFS2 Caused RAC server to crash

2009-06-17 Thread Sunil Mushran
Please file a bugzilla and _attach_ this oops trace. Also mention all the version numbers. On Jun 17, 2009, at 2:30 AM, "McDonald, Stuart" > wrote: Hi We have a two-node RAC cluster, which uses ASM for the database storage, but is using OCFS2 to mount a couple of file systems for a) th

[Ocfs2-users] OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 released

2009-06-16 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.4.2-1 and OCFS2 Tools 1.4.2-1 for Oracle's and Red Hat's Enterprise Linux 5 Update 2 and higher. Oracle's Unbreakable Linux Network users who are subscribing to the "OCFS2 1.4 packages for Enterprise Linux 5" channel can upgrade to this relea

Re: [Ocfs2-users] OCFS2 1.4.1 DLM unhandled error

2009-06-16 Thread Sunil Mushran
Please file a bugzilla in oss.oracle.com/bugzilla. Saul Gabay wrote: > > We have a 2 node OCFS2 cluster running Oracle 10g, both nodes crashed. > > > > Node 1 because it panic running IOSTAT, the second node crashed with > this error message you can see below. > > > > I was hoping to see a ne

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-11 Thread Sunil Mushran
ree working nodes in order to run /etc/init.d/o2cb offline on each one > followed by /etc/init.d/o2cb start? > > Best Regards > > John > > On Thu, 2009-06-11 at 07:33 -0700, Sunil Mushran wrote: > >> Add a rule to allow traffic on port (or whatever it is

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-11 Thread Sunil Mushran
iranha which invokes > iptables. What should I add to iptables to enable interconnect > traffic? > > TIA > > John > > On Wed, 2009-06-10 at 10:18 -0700, Sunil Mushran wrote: >> ensure iptables is either off or has rules for the interconnect >> traffic. >>

Re: [Ocfs2-users] How to mount OCFS2 file systems using the EMC Power Path multipath device

2009-06-10 Thread Sunil Mushran
There is another scheme (less elegant but probably quicker to deploy) that was described in this list by a user. As root run blkid. Then edit /etc/blkid.tab and remove all sd devices that correspond to the emcpp devices. Ensure pp is enabled. Rerun blkid. This time you should see the pp devices in

Re: [Ocfs2-users] mount.ocfs2: Transport endpoint is not connected while mounting

2009-06-10 Thread Sunil Mushran
ensure iptables is either off or has rules for the interconnect traffic. use tcpdump to see if packets are coming thru. The connect request is initiated between two nodes is initiated when both of them first mount a common volume. Also, the connect request is always from the higher node to the low

Re: [Ocfs2-users] OCFS2 hosting and running binaries

2009-06-09 Thread Sunil Mushran
Sure. One can use ocfs2 to host almost anything. The one exception is the crs_home. crs_home needs to be on a local volume. OCFS 1.2/1.4 has two limits. Like ext3, the number of sub-directories in _a_ directory cannot exceed 32000. (There is no limit to the number of subdirs in a volume.) The othe

Re: [Ocfs2-users] ocfs2 fencing with multipath and dual channel HBA

2009-06-08 Thread Sunil Mushran
florian.engelm...@bt.com wrote: > We tried to use ocfs2 with Vserver clustered with Heartbeat. But > Vservers need barrier=1. That did not work on our shared storage with > ocfs2 but I guess this is no ocfs2 problem it is a device mapper problem > because we need to use multipath and LVM2, isn't it

Re: [Ocfs2-users] ocfs2 in sles11 vs. sles10

2009-06-05 Thread Sunil Mushran
Are you sure you are mounting the same volume on both nodes? Do on both nodes: debugfs.ocfs2 -R "stats" /dev/sdX | grep UUID It should be the same. Ensure you don't have any local iscsi caching enabled. Bengtsson Anders wrote: > > I’m trying to mount a ocfs2 volume (created on sles11) on my sle

Re: [Ocfs2-users] Default Values of heartbeat dead threshold

2009-06-05 Thread Sunil Mushran
Actually, it is not complex. o2cb timeouts: If not using multipathing/netbonding, leave the timeouts as it. If using multipathing, double the disk hearbeat to 120 secs. If using netbonding, double the network idle to 60 secs. Ensure your private network has no loops to prevent spanning tree protoc

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
> node: > ip_port = > ip_address = 192.168.0.217 > number = 1 > name = nyclx2 > cluster = tiaa > > cluster: > node_count = 2 > name = tiaa > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@or

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
ed > Driver for "ocfs2_dlmfs": Loaded > Filesystem "ocfs2_dlmfs": Mounted > Checking O2CB cluster tiaa: Online > Heartbeat dead threshold = 31 > Network idle timeout: 3 > Network keepalive delay: 2000 > Network reconnect delay: 2000 > Checking O2CB

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
]# service iptables status > Firewall is stopped. > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, June 03, 2009 12:57 PM > To: McKinley, Reid > Cc: ocfs2-users@oss.oracle.com > Subject: Re: [Ocfs2-users] O2CB heart

Re: [Ocfs2-users] Problems mounting ocfs2 on 2 nodes

2009-06-03 Thread Sunil Mushran
Which cluster stack are you using? o2cb or pacemaker? Shaffin Bhanji wrote: > I have a 2 node cluster setup but am at a loss as to why I cannot > mount a shared ocfs2 filesystem on both nodes being shared by iSCSI? > Am I wrong in understanding that this can be achieved? > > I am using OpenAIS und

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
minues. Then, I can select "format". > > Let me know if you need further details. > Thanks, > Reid > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, June 03, 2009 12:48 PM > To: McKinley, Reid > Cc: o

Re: [Ocfs2-users] O2CB heartbeat not active on 2nd node

2009-06-03 Thread Sunil Mushran
The connect requests are not getting through. Do you have any firewalls setup? Is iptables running? If so, either shut it down or allow traffic on the o2cb port. McKinley, Reid wrote: > > We are having trouble getting the 2^nd node in our 2 node RAC > configuration to have an active O2CB heartbea

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
McKinley, Reid wrote: > No, "mounted.ocfs2 -d" comes back in less than 5 seconds. > > You said it taking 30 mins to do any action. Can you expand on that? As in, possibly walk us through your steps. ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle

Re: [Ocfs2-users] ocfs2console is slow

2009-06-03 Thread Sunil Mushran
McKinley, Reid wrote: > > Ever since we have installed OCFS2, we have had extremely slow > performance in the ocfs2console. It can take us over 30 minutes to do > the simplest tasks. > > > > We do not have this type of performance with other xwindows > applications on our server. > > > > D

Re: [Ocfs2-users] Fwd: fsck fails & volume mount fails, is my data lost?

2009-06-01 Thread Sunil Mushran
khaije rock wrote: > 90% of my way through the recovery and it turns out that each of the > more volume-wide rdump attemps were chocking at the same specific > point: a symlink pointing to a directory on a different filesystem > that then descended back into the ocfs volume. > > Looking like thi

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > Thanks for your reply. > > Good to know... time to upgrade the kernel. > > Is there a minimum required pacemaker or cman version too? > With pacemaker does it need heartbeat or openais for this? or either? > And if it maters what is the minimum version required for openais or > hea

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > If I will add pacemaker or cman package to my current OS, since > cluster fcntl is probably a new feature, is there a minimum version > number for fcntl to be supported? The support for clustered fcntl in ocfs2 was added in 2.6.27. ___

Re: [Ocfs2-users] ocfs2 and file locking?

2009-06-01 Thread Sunil Mushran
jhonyl wrote: > I am trying to figure out about locking under ocfs2. > > I read in the 1.4 ocfs2 pdf doc file that ocfs2 1.4 support flock but > not fcntl locks, and in a message, I read that ocfs2 rely on vfs for > fcntl, and I read something about being able to get fcntl locks but not > with o

Re: [Ocfs2-users] Fwd: fsck fails & volume mount fails, is my data lost?

2009-05-29 Thread Sunil Mushran
You are using ocfs2 atop lvm - a non-cluster-aware volume manager. A lot of things can go wrong in this combination. Quite a few have been reported on this forum. debugfs.ocfs2 has commands dump and rdump that allows users to read the files directly off the disk. Use it to recover your data. khai

Re: [Ocfs2-users] Cluster lockup when one node fails

2009-05-28 Thread Sunil Mushran
Kernel: 2.6.26-1-amd64 x86_64 > > modinfo ocfs2: > version:1.5.0 > description:OCFS2 1.5.0 > srcversion: B19D847BA86E871E41B7A64 > vermagic: 2.6.26-1-amd64 SMP mod_unload modversions > > ocfs2-tools: > Version: 1.4.1-1 > > Tia, > Kees Hoekzema

Re: [Ocfs2-users] Cluster lockup when one node fails

2009-05-27 Thread Sunil Mushran
kernel version, ocfs2 version? $ uname -a $ modinfo ocfs2 $ rpm -qa | grep ocfs2 Kees Hoekzema wrote: > Hello List, > > At the moment I'm running a 7-node ocfs2 cluster on a Dell MD3000i (iscsi) > NAS. This cluster has run fine for well over a year now, but recently one of > the older and more u

Re: [Ocfs2-users] Problem with OCFS2 on RHEL5.0 while installing CRS 10.2.01

2009-05-27 Thread Sunil Mushran
Refer to the section on oracle rdbms in the 1.4 user's guide. Specifically mount options. On May 27, 2009, at 4:55 AM, Devender Narula wrote: > > Hi team > > I had installed OCFS2 on RHEL5.0 . every thing looks fine but when I > was installing CRS on the node I got error message OCFS2 is n

Re: [Ocfs2-users] OCFS2 & Using the private interconnect with jumbo frames for heartbeat

2009-05-26 Thread Sunil Mushran
There are no known issues with using ocfs2 on a jumbo frame enabled private network. On May 26, 2009, at 8:38 AM, Sridhar Avantsa wrote: > When using OCFS2 in a Oracle RAC set, one would configure OCFS2 to > use the private interconnect address ( in cluster.conf). > Are there any known issue

Re: [Ocfs2-users] o2cb Configure problem

2009-05-25 Thread Sunil Mushran
> > ocfs2console-1.4.1-1.el5 > > ocfs2-tools-1.4.1-1.el5 > > [r...@eregtest2 software]# uname -a > > Linux eregtest2.admin.abdn.ac.uk 2.6.18-92.el5 #1 SMP Tue Apr 29 > 13:16:15 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux > > > --- On Mon, 5/25/09, Sunil Mushran wr

Re: [Ocfs2-users] o2cb Configure problem

2009-05-25 Thread Sunil Mushran
The ocfs2 kernel driver is missing. Read the user's guide or the FAQ to learn how to install the driver. On May 25, 2009, at 8:16 AM, Devender Narula wrote: > > HI Team > > When ever i try to configure o2cb . i get below mention proble.. its > a production installation and your quick help

Re: [Ocfs2-users] cluster manager does not start/run ERROR: OemInit2: Attempting to open the CMDiskFile for a multi-node RAC

2009-05-24 Thread Sunil Mushran
Use sles or (rh)el. It won't work on other distros. On May 24, 2009, at 3:43 PM, sundar mahadevan wrote: > To add some more details to this issue: > > /u01/oradata/orcl/orcl/cmquorumfile mentioned in the error message is > accessible from the shell prompt. I even checked the file > $ORACLE_HOME/

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-21 Thread Sunil Mushran
Brian Kroth wrote: > That's what's always held me back from doing this as well. Will the > common stack be the openais stack (ie: the so called user stack), the > o2cb stack, or something completely different? Currently the o2cb userspace clusterstack is directed towards supporting the native clu

Re: [Ocfs2-users] [Fwd: Re: Unable to fix corrupt directories with fsck.ocfs2]

2009-05-20 Thread Sunil Mushran
Brian Kroth wrote: > Luis Freitas 2009-05-20 10:46: >>I am not aware of any filesystem that can withstand a online fsck. >>Sun ZFS can do online correction, but it doesnt have a fsck tool. > I hear btrfs will support this. It may be a feature that's easier to > accomplish with copy on wri

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-20 Thread Sunil Mushran
Christian van Barneveld wrote: > No, I don't have the full output, but I still have the snapshots that I've > made before teh FSCK. I've mounted it at a different server and ran a > (readonly) FSCK. See attached output. The output shows i/o errors. It is unable to read the blocks beyond a certai

Re: [Ocfs2-users] Filesystem corruption and OCFS2 errors

2009-05-20 Thread Sunil Mushran
Christian van Barneveld wrote: > Our OCFS2 cluster has been stable for approx 8 months, but since this week it > went wrong. First we had high load problems. We had this problem because a > couple of directories got filled with files, 1 directory over 1,5 milion > files (symlinks) and NFS (mount

Re: [Ocfs2-users] OCFS2 mount points will not automatically mount on boot

2009-05-20 Thread Sunil Mushran
McKinley, Reid wrote: > > We are trying to figure out why our OCFS2 mount points will not > automatically mount on reboot. > > We are on OEL version 2.6.18-92.el5 and we are using multipathing. > > Here are the OCFS2 entries in /etc/fstab. > > /dev/dm-2 /oracw oracle_clusterware datavolume,nointr,

Re: [Ocfs2-users] Debugging help / Guidance on architecture

2009-05-19 Thread Sunil Mushran
Damon Miller wrote: > The two servers are actually connected to the same switch. We are using > iptables for basic packet filtering on all of our hosts, but TCP/ is open > on all machines participating in the cluster. iSCSI is also enabled on > TCP/3260. Here are the relevant excerpts fro

Re: [Ocfs2-users] Debugging help / Guidance on architecture

2009-05-15 Thread Sunil Mushran
Damon Miller wrote: > We're running a 3-node OCFS2 1.2.9 cluster with a 5-TB iSCSI block device as > the backing store. All machines are running CentOS, with the iSCSI target > running CentOS 5.2 and the initiators running CentOS 4.7. The purpose of the > cluster is to evaluate alternatives to

Re: [Ocfs2-users] Unable to fix corrupt directories with fsck.ocfs2

2009-05-13 Thread Sunil Mushran
Did you run fsck with the force flag? $ fsck.ocfs2 -f /dev/sdX By default, fsck only replays the journals. Paul Taylor wrote: > Hi > > errors like the one listed below have been coming through in our logs on > a daily basis. We tried to run fsck.ocfs2 over the file system bet it > thinks that i

Re: [Ocfs2-users] Max number of files?

2009-05-12 Thread Sunil Mushran
Gavin Hamill wrote: > ocfs2 1.4 has a maximum of 32000 files in any single directory - we got > bitten by this bug recently. If you're talking about 5 million files, > then is there's a possibility you've encountered this limit? Incorrect. The limit is 32000 sub-dirs in a directory. There is no sp

Re: [Ocfs2-users] df & du - that old chestnut

2009-05-07 Thread Sunil Mushran
http://oss.oracle.com/~smushran/.debug/scripts/stat_sysdir.sh Please file a bugzilla and attach the output of the above script. http://oss.oracle.com/bugzilla I think I know the issue. No, it is not related to blocksize/clustersize. Sunil Nigel Bishop wrote: > > Afternoon, > > We have an ocfs2

Re: [Ocfs2-users] ocfs2 vs ext3?

2009-04-30 Thread Sunil Mushran
Andrew (Anything) wrote: > Ive been testing using bonnie++ -n 50:1024:0:10 -s 0. Is this a bad way to > test? > Some raw results follow later in case you want them. > > Obviously ocfs2 should be slower than ext3. > But I guess I expected a single node ocfs node to be only doing internal > stuff wit

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-29 Thread Sunil Mushran
pr 29 12:01:17 nyclx2 kernel: ocfs2: Unmounting device (8,0) on (node > 255) > > Thanks again, > Reid > > -Original Message- > From: Sunil Mushran [mailto:sunil.mush...@oracle.com] > Sent: Wednesday, April 29, 2009 4:32 PM > To: McKinley, Reid > Cc: ocfs2-users@oss.

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-29 Thread Sunil Mushran
What does "mounted.ocfs2 -d" say on both nodes? Not /var/log/dmesg. It is /var/log/messages. You could instead run "dmesg". This is important as it will tell you why the mount failed. Sunil McKinley, Reid wrote: > Thank you! > > Everything appears to be fine then, except that we cannot mount an

Re: [Ocfs2-users] O2CB heartbeat: Not active

2009-04-28 Thread Sunil Mushran
McKinley, Reid wrote: > > We have installed OCFS2 1.4.1 and for some reason we can only get the > mount point mounted on 1 of 2 nodes. The 2^nd node shows that the > heartbeat is not active. > > > > [r...@nyclx2 ~]# service o2cb status > > Driver for "configfs": Loaded > > Filesystem "configf

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-28 Thread Sunil Mushran
ave mounted the particular filesystem? > > Søren > > On 28/04/2009, at 22.21, Sunil Mushran wrote: > >> Søren Kröger wrote: >>> I'm trying to split up our big OCFS2 filesystem into 3 separate >>> LUN's, since there are only a limited amount of nodes which

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-28 Thread Sunil Mushran
Søren Kröger wrote: > I'm trying to split up our big OCFS2 filesystem into 3 separate LUN's, > since there are only a limited amount of nodes which need access to > the different parts of the OCFS2 filesystem. > One "Master" server with RW access should still be able to mount all 3 > OCFS2 LUN's

Re: [Ocfs2-users] Multiple clusters per node?

2009-04-27 Thread Sunil Mushran
While a node can only be in one cluster, it can be in many different dlm domains (or lock name spaces), each of which can have a different collection of nodes. But in the end, the sum total of all nodes in all domains will still part of one cluster. What are you trying to do? Sunil Søren Kröger

Re: [Ocfs2-users] kernel panic - ocfs2-1.4.1 - redhat EL5 2.6.18-92.el5xen

2009-04-24 Thread Sunil Mushran
Please file a bugzilla. Add this stack trace to it. http://oss.oracle.com/bugzilla Also add any detail about your environment that you feel could be relevant. Size of cluster, number of mounts, etc. Thanks Sunil James Masson wrote: > Hi list, > > We've just had a kernel panic/reboot on one of ou

Re: [Ocfs2-users] BUG: soft lockup - CPU#1 stuck for 61s

2009-04-20 Thread Sunil Mushran
File a bugzilla (oss.oracle.com/bugzilla) for this issue. Attach the stack trace. Also attach the output of the following. $ find /lib/modules/`uname -r`/kernel/fs/ocfs2 -name \*.ko -exec objdump -DSl {} >/tmp/ocfs2.syms \; Sunil Konstantin Tikhonov wrote: > Нi, > I have a cluster with 5 nodes

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-13 Thread Sunil Mushran
s version 95, which doesn't > create them for me. > Version 127 I used for tests already creates them correctly.. > BR > nik > > On Mon, Apr 06, 2009 at 12:08:34PM -0700, Sunil Mushran wrote: > >> AFAIK, this is not an issue on (rh)el4/sles9. It could be that

Re: [Ocfs2-users] Encountered disk I/O error 19502

2009-04-06 Thread Sunil Mushran
Yes. This issue has nothing to do with taf or asm. You are seeing transient EIOs during archiver writes. Follow Srini's suggestion of filing a SR and then pinging him. The other issue about instances altering the service names is unrelated. Atleast we should consider it unrelated as they have noth

<    1   2   3   4   5   6   7   8   9   10   >