Re: [Ocfs2-users] [ocfs2-users] RedHat 4 Update 2

2006-02-16 Thread Sunil Mushran
We will be releasing one by tomorrow. Christophe JOBARD (GHH) wrote: Hi, Where can i get the RPM's of the OCFS2 software for the new Red Hat Enterprise 2.6.9-22.0.2 kernel (RH4 Update 2) ? Many Thanks, Christophe JOBARD

Re: [Ocfs2-users] Problem with configuration file.

2006-02-20 Thread Sunil Mushran
oops... it'll be fixed today. Mathieu Avila wrote: Norbert Tretkowski wrote: * Mathieu Avila wrote: I must have missed something obvious, but i can't see what. Any ideas? You forgot indention in the configuration file. Norbert Thank

Re: [Ocfs2-users] Add a new node to ocfs cluster

2006-03-13 Thread Sunil Mushran
the ocfs perspective? Best regards, Llorenç Vanaclocha -Mensaje original- De: Sunil Mushran [mailto:[EMAIL PROTECTED] Enviado el: sábado, 11 de marzo de 2006 0:29 Para: Vanaclocha Llorens, Jose Lorenzo CC: ocfs2-users@oss.oracle.com Asunto: Re: [Ocfs2-users] Add a new node

Re: [Ocfs2-users] nodes dont see eachother pls help!

2006-03-21 Thread Sunil Mushran
Is this a shared disk? Do: # echo stats | debugfs.ocfs2 -n /dev/sdX | grep UUID on all nodes Is the UUID the same? Oneill wrote: Hi! I working on an oracle cluster but I cannot get fahrer because ocfs2 nodes dont synchronize. I can create ocfs2 filesystem both mashine if i want but they

Re: [Ocfs2-users] Getting eI am using RHLError when mounting shar ed OCFS2 device.

2006-03-30 Thread Sunil Mushran
Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, March 30, 2006 5:34 PM To: Vaidya, Sachin Cc: ''ocfs2-users@oss.oracle.com' ' Subject:Re: [Ocfs2-users] Getting eI am using RHLError when mountingshar ed OCFS2 device. Remove vip and mount

Re: [Ocfs2-users] understanding self fencing with ocfs2

2006-05-02 Thread Sunil Mushran
In a 2 node setup, if node 0 or 1 crashes, the other node should survive. The one issue encountered by many users was while shutting down node 0, node 1 would fence it self. The latter was because of the sequencing of service shutdowns. We added ocfs2-init script to handle shutdown sequencing.

Re: [Ocfs2-users] Node panic

2006-05-10 Thread Sunil Mushran
You may want to upgrade to 1.2.1. We have done fixes in this area. Jim Erb wrote: Can anyone tell me what might be happening here. I have a 3 node cluster running under RH AS 4 (2.6.9-22.0.1.ELsmp) with ocfs2 v. 1.2.0-1. I've recently implemented elevator=deadline in grub.conf to fix some

Re: [Ocfs2-users] OCFS2 hangs system on reboot

2006-05-16 Thread Sunil Mushran
ocfs2-tools includes two init scripts, o2cb and ocfs2. Ensure the scripts are active and running in the correct sequence. As in, the startup seq should network, o2cb and then ocfs2. The shutdown is the reverse of that. [EMAIL PROTECTED] wrote: Anyone experience OCFS2 hanging the system on

Re: [Ocfs2-users] Disk-based DLM

2006-06-06 Thread Sunil Mushran
OCFS2 does not have a disk-based dlm. Net connectivity is a must. Leonardo de Assis wrote: Hi, I have two machines that does not have network connection. If my disk can be shared between them, there is an way to use disk-based dlm or any other manner that does not relay on network access?

Re: [Ocfs2-users] RHEL 4 U2 / OCFS 1.2.1 weekly crash?

2006-06-09 Thread Sunil Mushran
The hb failure is just the effect of the ios not completing within 12 secs. The full oops trace gives the last 24 ops and their timings. One solution is to double up the hb timeout. Set, O2CB_HEARTBEAT_THRESHOLD = 14 Brian Long wrote: Hello, I have two nodes running the 2.6.9-22.0.2.ELsmp

Re: [Ocfs2-users] bug in /etc/init.d/o2cb?

2006-06-14 Thread Sunil Mushran
Yes, we are missing that bit. File a bug on http://oss.oracle.com/bugzilla component ocfs2-tools. [EMAIL PROTECTED] wrote: hi, maybe this is not the place to file a bug, but I think there is one in /etc/init.d/o2cb. the script should be used to create the config file

Re: [Ocfs2-users] Different versions of ocfs2 and Kernel

2006-06-14 Thread Sunil Mushran
Straße 29 01189 Dresden Telefon: +49 (0) 351/4021 655 Telefax: +49 (0) 351/4021 696 Mailto: [EMAIL PROTECTED] Web: www.robotron.de -Ursprüngliche Nachricht- Von: Sunil Mushran [mailto:[EMAIL PROTECTED] Gesendet: Dienstag, 13. Juni 2006 18:14 An: Marco Friebe Cc: ocfs2-users

Re: [Ocfs2-users] change heartbeat threshold online?

2006-06-14 Thread Sunil Mushran
It's not a sysctl entry. It won't work that way. Set the required value in /etc/sysconfig/o2cb and restart the cluster. Do it on all nodes. [EMAIL PROTECTED] wrote: hi, I'm just thinking about changing the heartbeat threshold of our cluster online by issuing # echo 31

Re: [Ocfs2-users] kernel BUG at /rpmbuild/smushran/BUILD/ocfs2-1.2.1/fs/ocfs2/file.c:787!

2006-06-21 Thread Sunil Mushran
Check out http://oss.oracle.com/bugzilla/show_bug.cgi?id=723 Peter McMahon wrote: All still working on the use of OCFS2 Yesterday, when we were running autoconfig for an Apps DB node in a RAC cluster the other node crashed extract from /var/log/messages...is below... If anyone

Re: [Ocfs2-users] Error while Mounting

2006-06-26 Thread Sunil Mushran
Is it always the mount using node slot 1 that fails? If so, the jbd superblock may be corrupted for that slot. Grow the journal by, say, 1MB. It will reinitialize the JBD superblock for all the slots. Either that or just reformat the device. To see the size of the existing journal, do: # echo

Re: [Ocfs2-users] out of memory?

2006-06-29 Thread Sunil Mushran
I would like the entire /proc/meminfo and /proc/slabinfo. Dump it to a file every 1 min or so. What version of the kernel/ocfs2? Paul Jimenez wrote: On Jun 29, 2006, at 8:22 AM, Brian Long wrote: On Wed, 2006-06-28 at 17:03 -0500, Paul Jimenez wrote: I'm getting out of memory

Re: [Ocfs2-users] out of memory?

2006-06-29 Thread Sunil Mushran
://www.rgmadvisors.com/~pj/memslabinfo. Kernel is 2.6.16.7 vanilla, and the version of ocfs2 it came with. --pj On Jun 29, 2006, at 2:10 PM, Sunil Mushran wrote: I would like the entire /proc/meminfo and /proc/slabinfo. Dump it to a file every 1 min or so. What version of the kernel

Re: [Ocfs2-users] out of memory?

2006-07-05 Thread Sunil Mushran
of lowmem? will turning on HIGHPTE be enough to fix this? --pj On Jun 29, 2006, at 5:02 PM, Sunil Mushran wrote: HighFree: 11877028 kB LowFree:391020 kB HighFree: 11761892 kB LowFree:342380 kB HighFree: 11654316 kB LowFree:315860 kB HighFree

Re: [Ocfs2-users] What is wrong

2006-07-06 Thread Sunil Mushran
Before you can mount, you have to ensure all the nodes in the cluster access the same device. #echo stats | debugfs.ocfs2 -n /dev/sdX | grep UUID should return the same uuid from all nodes. Once all nodes can see the same device, the you can mount it on all nodes. There are no passive node(s).

Re: [Ocfs2-users] Resizing OCFS2 Filesystems

2006-07-06 Thread Sunil Mushran
ocfs2-tools 1.2.2 will have the offline-extend feature. Still in testing. Karen Penman wrote: Hi All, Can anyone tell me if OCFS2 filesystems can be dynamically extended? If not, is this something that is likely to be available in the future? Thanks, Karen

Re: [Ocfs2-users] Unable to mount node2 mount.ocfs2: Transport endpoint is not connected while mounting /dev/sdb1 on /u02/oradata/orcl

2006-07-06 Thread Sunil Mushran
Check dmesg on both nodes. The error indicates that the connect failed. Ensure the ip addresses of all nodes in /etc/ocfs2/cluster.conf are correct. Also, that the conf file is the same on all nodes. Try pinging the other node on the configured interface: # ping -I ethX node1 Akin Seigmund

Re: [Ocfs2-users] [Ocfs2-announce] OCFS2 1.2.2 released

2006-07-18 Thread Sunil Mushran
ocfs2-tools 1.2.2 :) Brian Long wrote: On Fri, 2006-06-30 at 16:10 -0700, Sunil Mushran wrote: All, We are pleased to announce the release of OCFS2 1.2.2. This release includes some recent fixes, including bugzilla#723 http://oss.oracle.com/bugzilla/show_bug.cgi?id=723. (Users

Re: [Ocfs2-users] OCFS2 and Snapshots

2006-07-20 Thread Sunil Mushran
OCFS2 relies on the uniqueness of the uuid for it to distinguish between different volumes. One cannot mount two volumes having the same uuid on the same node. Infact, one should not do that across the cluster too, i.e., mount two different physical volumes having the same identical uuid. If you

Re: [Ocfs2-users] OCFS2 and Snapshots

2006-07-21 Thread Sunil Mushran
|| upd_vsize || upd_uuid) { block_signals(SIG_BLOCK); ret = ocfs2_write_super(fs); if (ret) { Sunil Mushran wrote: Please could you send it to me again in the diff -u -p format. Andre Brinkmann wrote: Sorry, here the patch as text: For the Makefile: 39c39 $(LINK

Re: [Ocfs2-users] OCFS2: Could not start cluster stack

2006-07-27 Thread Sunil Mushran
Check the support guide on cluster start/stop in the doc section on http://oss.oracle.com/projects/ocfs2. Vicki Luo wrote: I installed OCFS2 on RHEL4 with ocfs2-2.6.9-22.ELsmp-1.2.2-1.i686.rpm. When I start ocfs2console and click on Cluster, and then Configure Nodes, it returns a dialog

Re: [Ocfs2-users] Private Interconnect and self fencing

2006-07-28 Thread Sunil Mushran
-JPH Sunil Mushran wrote: The 12 sec default is low. Bump it up to 30 secs or even higher. FAQ has the details. The higher you set it to, the longer the brown-out time. Jeffery P. Humes wrote: I have an OCFS2 filesystem on a coraid AOE device. It mounts fine, but with heavy I/O the server

Re: [Ocfs2-users] ocfs2_search_chain: Group Descriptor has bad signature

2006-07-31 Thread Sunil Mushran
What version of ocfs2 is on the nodes? Do modinfo ocfs2 on all nodes. The version of OCFS2 shipped with SLES9 SP3 varies with kernel. Are you using the modules shipped by suse or building them yourself? Vladan Gunjic wrote: I've got a strange issue with the following configuration: Using

Re: [Ocfs2-users] Question

2006-08-01 Thread Sunil Mushran
Just create a one node cluster. However, if you were to mount two mirrored volumes on the same node, you will have problems as detailed in this thread: http://oss.oracle.com/pipermail/ocfs2-users/2006-July/000630.html Thanks to Andre, the next drop of ocfs2-tools will have a fix for this

Re: [Ocfs2-users] re: question on adding a node to RAC cluster and o2cb

2006-08-01 Thread Sunil Mushran
When you added the new node using ocfs2console, did it show up in: # ls /config/cluster/clustername/node/ I am assuming that it was added in /etc/ocfs2/cluster.conf. Yes, the docs does not cover this as of now. I will update the FAQ/user's guide with the info. Peter Santos wrote: -BEGIN

Re: [Ocfs2-users] re: question on adding a node to RAC cluster and o2cb

2006-08-01 Thread Sunil Mushran
restarted the cluster on node1. (transport endpoint errors..) We will definitely try again on a 3rd node, I'm just not clear on what the sequence of events should be. thanks peter Sunil Mushran wrote: When you added the new node using ocfs2console, did it show up in: # ls /config/cluster

Re: AW: [Ocfs2-users] ocfs2_search_chain: Group Descriptor has bad signature

2006-08-01 Thread Sunil Mushran
on ocfs2 version 1.2.1 ? Although they were not directly involved in corruption, maybe indirect ? Thanks, Vladan -Ursprüngliche Nachricht- Von: Sunil Mushran [mailto:[EMAIL PROTECTED] Gesendet: Dienstag, 1. August 2006 04:29 An: Vladan Gunjic Cc: ocfs2-users@oss.oracle.com Betreff: Re

Re: [Ocfs2-users] o2net: connect to node has been idle for 10 secs

2006-08-03 Thread Sunil Mushran
1. o2net talks tcp. It should be able to handle this. 2. If the cluster is active and the nodes are communicating, the keepalive packet is rarely sent. It only sends the packet if it does not hear from the other node for 5 secs. 3. Try the same with 1.2.3. (We made 2 important 1 line fixes.) 4.

Re: [Ocfs2-users] Re: Problems with OCFS2 and Oracle 10g

2006-08-04 Thread Sunil Mushran
ocfs2 requires a shared disk. As in, all nodes must be able to concurrently read/write to the device. sorapak Last wrote: Yes. my disk is an IDE. Would it cause the problems? Thanks Sorapak

Re: [Ocfs2-users] o2net: connect to node has been idle for 10 secs

2006-08-07 Thread Sunil Mushran
Alexei_Roudnev wrote: In my case, after spending few days, I find that my HugeTLB setting (in Oracle) caused long kernel loop and it forced OCFSv2 to reboot because of losing connection. I am keen to hear more about this. Please could you elaborate.

Re: [Ocfs2-users] Installing ocfs2-tools from source?

2006-08-14 Thread Sunil Mushran
Do, make rpm instead. Change Copyright to License in the spec file and do make rpm. I built the following for fc5/x86. http://oss.oracle.com/~smushran/.fc5-rpms/ Eric Adair wrote: building on fedora core 5, kernel 2.6.16.-1.2133.FC5smp Everything builds fine, but I can't find a means to make

Re: [Ocfs2-users] OCFS- EMC Issue

2006-08-14 Thread Sunil Mushran
# cd /tmp # wget http://oss.oracle.com/~smushran/.debug/stat_sysdir.sh # ./stat_sysdir -d sdX sys.out Email me the output. amit pansare wrote: I’ve an issue related to Oracle 10g RAC. I’ve 2 node cluster each being Dell 2850 Server with RHEL 4.0 I’ve EMC CX300 SAN storage with following

Re: [Ocfs2-users] cfq scheduler?

2006-08-14 Thread Sunil Mushran
U4 has the fix. We've tested U2 (and U3) + fix internally already. So we don't feel the need to rerun the test for the same again. Brian Long wrote: Has anyone at Oracle tested the RHEL 4.4 beta or GA kernel to verify the cfq scheduler is fixed wrt. OCFS2? Or will that testing only begin now

Re: [Ocfs2-users] re: Process to change cluster.conf IPS ?

2006-08-16 Thread Sunil Mushran
where that IP may exist. - -peter Sunil Mushran wrote: http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#CONFIGURE Peter Santos wrote: Folks, I have a simple 2 node 10gR2 RAC cluster. Each node has a public/private and virtual IP. We moved the network

Re: [Ocfs2-users] OCFS2 over DRBDv8

2006-08-17 Thread Sunil Mushran
As far as ocfs2 is concerned, bio_add_page() is failing. The one thing that springs to mind is that o2hb sets bio-bi_sector to 512 bytes and not the block size. Kilian CAVALOTTI wrote: Hi all, I'm new to OCFS2, but not so new to DRBD. I'd like to use the new primary/primary feature of DRBDv8

Re: [Ocfs2-users] Wrong dm device used

2006-08-29 Thread Sunil Mushran
Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions. We have to teach it new tricks. :) Fabio Corazza wrote: Hi there, I've just setup an EVMS cluster with Heartbeat 2.0.7 and OCFS2. Everything seems to be working fine except this: [EMAIL PROTECTED] photos]# mounted.ocfs2 -d

Re: [Ocfs2-users] Wrong dm device used

2006-08-30 Thread Sunil Mushran
appreciated. Fabio Sunil Mushran wrote: Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions. We have to teach it new tricks. :) Fabio Corazza wrote: Hi there, I've just setup an EVMS cluster with Heartbeat 2.0.7 and OCFS2. Everything seems to be working fine except

Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot

2006-09-15 Thread Sunil Mushran
). - Original Message - From: Holger Brueckner [EMAIL PROTECTED] To: Sunil Mushran [EMAIL PROTECTED] Cc: ocfs2-users@oss.oracle.com Sent: Friday, September 15, 2006 1:20 AM Subject: Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot i guess i found the solution. while dumping

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-20 Thread Sunil Mushran
-- 0 503 500 0 12-Aug-2006 10:40 0041a286 -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Wednesday, September 20, 2006 12:32 PM To: Matthew Flusche Cc: Ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] ocfs2 - disk usage

Re: [Ocfs2-users] Use of OCFS2 file systems.

2006-09-29 Thread Sunil Mushran
Yes. Bill Wells wrote: All, Can someone comment on whether it is recommended to use the OCFS2 file system for the admin directories of a RAC database. Specifically, for bdump, udump, cdump, etc. This is being considered on RHEL4-U4 with 10gR2 on a 3 node cluster. Thanks much, Bill Wells

Re: [Ocfs2-users] Re: FW: Use of OCFS2 file systems.

2006-10-04 Thread Sunil Mushran
File a bug on bugzilla (oss.oracle.com/bugzilla) with the full oops trace and any other information that seems relevant. Galan Merchan, Martin wrote: Hello, I’m working with OCFS2 on Radhat Advanced Server 4 Patch 3 and I had kernel panics too. I use OCFS2 only for RAC archive logs and RMAN

Re: [Ocfs2-users] Resizing mountpoint in ocfs2

2006-10-05 Thread Sunil Mushran
Yes, the last patch to add this feature is in review. We will release this as part of ocfs2-tools 1.2.2. Kerr-Sheppard, Stephen wrote: Has anyone had to resize a mountpoint in ocfs2. In ocfs version 1 it was a case of unmounting and using the resizeocfs command. Is this still the same for

Re: [Ocfs2-users] 2 Node cluster, and nodes OS hang

2006-10-06 Thread Sunil Mushran
tcpdump -i eth1 -C 10 -W 15 -s 1 -Sw /tmp/`hostname -s`_tcpdump.log -ttt 'port ' Do this on both nodes before mounting on the second node. Ping me with the path to the logs. [EMAIL PROTECTED] wrote: Hello All, I have a NAS that I would like to use ocfs2 on. Currently there are

Re: [Ocfs2-users] Getting Started with ocfs2

2006-10-11 Thread Sunil Mushran
Martin J. Evans wrote: fine but on selecting cluster/configure nodes I still get dialogue saying Could not query the state of the cluster stack. This must be resolved before any OCFS2 filesystemcan be mounted. Could be because the script is installed as o2cb and not o2cb.init. Fedora

Re: [Ocfs2-users] out of memory... doing heavy IO on ocfs2 is wasting (low) memory?!

2006-10-11 Thread Sunil Mushran
Still in testing. It is a larger patch than normal and thus requires more time/effort. Once we are comfortable with it, we will look into releasing the patch for others to test before releasing 1.2.4. Jonah H. Harris wrote: What's the status on this? I've researched Bugzilla, SVN, and the

[Ocfs2-users] disk heartbeat timeout poll

2006-10-11 Thread Sunil Mushran
Thanks for all the replies in the previous usage poll. One of the chief concerns expressed was the (very) low default disk heartbeat timeout setting. Well, we want to bump it up but to what? Here are some qs the answers to which will help us determine that value. 1. What is the your disk

Re: [Ocfs2-users] SUSE Patches

2006-10-20 Thread Sunil Mushran
Ping Novell. They issue interim PTF SLES kernels with the required fix(es) to help users tide over until the formal release. Needless to add, you need to have Novell Support. Andy Kipp wrote: Hello all, I am running SLES9 with the latest kernel patches (2.6.5-7.282-bigsmp) and ocfs2 version

Re: [Ocfs2-users] RHEL 4 hotfix RPMs?

2006-10-23 Thread Sunil Mushran
# ./configure --with-kernel=/usr/src/kernels/2.6.9-42.X.EL-smp-i686/ # make rhel4_2.6.9-42.X.EL_rpm The rpms will be in the rpmdir as specified in ~/.rpmmacros. ~$ cat .rpmmacros %_topdir/rpmbuild/user %_tmppath /rpmbuild/user/tmp %_sourcedir /rpmbuild/user/SOURCES %_specdir

Re: [Ocfs2-users] 1.2.2 dump issue

2006-10-25 Thread Sunil Mushran
As the ocfs2 home page suggests, when building 1.2.x against mainline 2.6.14 and above, specify GENERIC_DELETE_INODE_NOT_TRUNCATES=1. Peter Larsen wrote: I'm running 1.2.2 here - compiled from source, and while I can read files, trying to delete a file on my OCFS2 volume produces the following:

Re: [Ocfs2-users] lvm2 not cluster aware - okay, so how should Istripe my LUNs?

2006-10-25 Thread Sunil Mushran
Fabio Corazza wrote: Last but not least.. a question for Sunil if he's gonna read this.. when OCFS2 will support data-on-inode would we need to reformat the file systems or will the new module be compatible with the 1.4 on-disk data? I am envisioning a compat flag to be added on existing

Re: [Ocfs2-users] OCFS2 Fencing and Locking MSA500 Array: Help

2006-10-25 Thread Sunil Mushran
Oct 11 05:15:28 vhaispora01 kernel: cciss0: unsolicited abort f7000250 Oct 11 05:15:28 vhaispora01 kernel: cciss0: retrying f7000250 That's where the problem begins. The cciss driver is unable to to complete the ios due to a bus reset maybe. Ping HP or whoever your contact is for the MSA500.

Re: [Ocfs2-users] BUG: unable to handle kernel NULL pointer dereference

2006-10-27 Thread Sunil Mushran
Please file a bugzilla with the details provided. It is easier to manage bugs that a way. Thanks Christian Schlittchen wrote: Thanks to syncronous writes on the log-files I finally managed to get a log of the regular panics we experience. The setup is as follows: Three blades (IBM HS20)

Re: [Ocfs2-users] Unexpected reboot / crash

2006-10-27 Thread Sunil Mushran
The first issue could be because you don't have ocfs2-tools 1.2.2. The earlier version was missing a line in the ocfs2 init script. Rafal Maliszewski wrote: Hi guys I installed ocfs2 on 4 node (redhat 4u3) on shared FC devices ( EMC storage ). So I've noticed several problems: 1. When I

Re: [Ocfs2-users] Interesting Error

2006-10-30 Thread Sunil Mushran
Which version of OCFS2? Did you run fsck.ocfs2 -f on that device? Do: # echo stat 6518860 | debugfs.ocfs2 -n /dev/sdX /tmp/ext.out Email ext.out. Andy Kipp wrote: Anybody have any idea what this error involves? Or how to resolve it? Oct 30 05:11:24 groupwise-1-mht kernel:

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
Are you using NFS by any chance? I am looking into bug#790 that also encounters the same error (ESTALE). Matthew Flusche wrote: I received the following error messages in the system logs. Is this anything to be concerned with? kernel: (4074,0):ocfs2_populate_inode:234 ERROR: Invalid

Re: [Ocfs2-users] Interesting Error

2006-10-31 Thread Sunil Mushran
Replace sdX with the device on which the ocfs2 fs exists. You can use mount | grep ocfs2 to find that volume. If the inode on disk is good, one explanation for the issue could be the lvb bug which was fixed in 1.2.2. Ping Novell to get a PTF kernel with ocfs2 1.2.3. Andy Kipp wrote: Which

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
So it is bug#790. It just may be a case of unnecessary error messages for you. I am still investigating it. Matthew Flusche wrote: Yes, one of the clustered file systems is shared with nfs. -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 31

Re: [Ocfs2-users] Ocfs2 and low memory

2006-10-31 Thread Sunil Mushran
To monitor ocfs2 memory usage, do: # cat /proc/slabinfo | egrep 'ocfs|dlm|size-256 |size-32 ' ocfs2_lock16226 16 2261 : tunables 120 60 0 : slabdata 1 1 0 ocfs2_inode_cache 22 24 115231 : tunables 24 12 0 : slabdata 8

Re: [Ocfs2-users] Newbie questions -- is OCFS2 what I even want?

2006-11-03 Thread Sunil Mushran
You are probably looking for a distributed file system. Check out afs and/or v9fs. Thad Beier wrote: Dear Sirs and Madams, I run a small visual effects production company, Hammerhead Productions. We'd like to have an easily extensible inexpensive relatively high-performance storage network

Re: [Ocfs2-users] about 2 nodes enviroment and metalink note 394827.1

2006-11-09 Thread Sunil Mushran
I would imagine you are using RHEL4. If so, upgrade the ocfs2-tools to 1.2.2. The previous version of the ocfs2 init script did not always umount ocfs2 volumes on clean shutdowns leading to this problem. [EMAIL PROTECTED] wrote: Hi to all: In 2 nodes environment I've 'suffered' the 'reboot 1st

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-09 Thread Sunil Mushran
of 3,400 IO/sec while the same benchmark with the same data will max out at 7K+ IO/sec on RAW. I'll grab the iostat data which we've kept over time and try to make some sense of it before posting anything additional. Thanks. /Brian/ On Thu, 2006-11-09 at 10:20 -0800, Sunil Mushran wrote: Why

Re: [Ocfs2-users] frozen ocfs2 filesystem under heavy webserver load

2006-11-13 Thread Sunil Mushran
None of these locks are busy. So they should not be the cause of the problem. Start with the version of ocfs2. Also, which kernel? What does top say? Is some process spinning? Also, what does this stresstest entail? Stephan Hendl wrote: Hi, I use a cluster of 4 nodes with ocfs2 as a

Re: [Ocfs2-users] Ocfs2 errors on 3 node cluster

2006-11-14 Thread Sunil Mushran
It will be easier if you file a bug on oss.oracle.com/bugzilla with all the details. Like messages files from all nodes, etc. Why are you using 1.2.1? 1.2.3 has been out for few months now. Randy Ramsdell wrote: Hi, Maybe someone could elaborate on these re-occuring ocfs2 errors that always

Re: [Ocfs2-users] Bad magic number in inode

2006-11-15 Thread Sunil Mushran
The quick detect just looks for the superblock which is in the third block of the device. The full detect looks up the superblock and then the system directory. In your case it fails to locate the latter. This is one of the quirks when using an unpartitioned disk and later partitioning it. The

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
input is greatly appreciated. Thanks, Colin Farley Network Administrator E-Care Contact Center Services Phone:(204) 940-6244 Fax:(204) 940-7394 Sunil Mushran

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
system). Well known problem with OCFSv2. One solution is to add 3-d node and use interface bonding (be sure that interface convergeency time is less that o2cb timeout). - Original Message - From: [EMAIL PROTECTED] To: Sunil Mushran [EMAIL PROTECTED] Cc: ocfs2-users@oss.oracle.com Sent

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
and everything will change. - Original Message - From: Sunil Mushran [EMAIL PROTECTED] To: Alexei_Roudnev [EMAIL PROTECTED] Cc: [EMAIL PROTECTED]; ocfs2-users@oss.oracle.com Sent: Wednesday, November 15, 2006 11:03 AM Subject: Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem You

Re: [Ocfs2-users] re: o2hb_write_timeout:270 ERROR: Heartbeat write timeout

2006-11-22 Thread Sunil Mushran
is this message getting 10 seconds from? Also this message is displayed because dbo2 was not able to check into the hearbeat filesystem right ? - -peter Sunil Mushran wrote: On nodes db01 and db03 hb timed-out at 17:12:49. However, the nodes did not fully panic. As in, the network

Re: [Ocfs2-users] Oracle 9i RAC on OCFS2

2006-11-27 Thread Sunil Mushran
Refer to CDSL (Conext Dependent Symbolic Links) in the OCFS2 user's guide. Marcel Savelkoul wrote: Hi, I'm setting up a 2-node Oracle 9i RAC on OCFS2. But I have some problems with understanding how the shared Oracle_Home is being used. For instance there is the

Re: [Ocfs2-users] OCFS2 and berkeley database files

2006-12-05 Thread Sunil Mushran
You are on a very old release of OCFS2. The OCFS2 homepage and FAQ both list a SLES9 kernel version newer than the one you are using. But that may not be the reason for the error. My bet is that bdb is attempting to create a shared writeable mmap that ocfs2 1.2 does not support. [EMAIL

Re: [Ocfs2-users] Oracle Application Server 10.1.2.0.2 Install on OCFS2

2006-12-06 Thread Sunil Mushran
strace apache. That may provide us with some clues. [EMAIL PROTECTED] wrote: Hello all, Has anyone installed Oracle Application Server 10.1.2.0.2 Infrastructure tier including the preseeded 10.1.0.4 database (High Availability option otherwise known as a cold failover cluster) on OCFS2

Re: [Ocfs2-users] OCFS2 and berkeley database files

2006-12-06 Thread Sunil Mushran
ocfs2 supports private mmap r/w and shared mmap readonly. Shared mmap writeable is the only piece missing. We should have that by 1.4. Alexei_Roudnev wrote: There was a clear answer, WHY it did not worked on OCFSv2: - BerkleyDB and LDAP uses mmap to the files; - OCFSv2 don't implement it

Re: [Ocfs2-users] re: is it possible for the o2cb stack to monitor multiple clusternames on the same box

2006-12-20 Thread Sunil Mushran
, they will never be part of that domain. Sunil Mushran wrote: Currently it supports only one cluster. Peter Santos wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Folks, When I installed ocfs2 the first time and setup oracle to work with it, the clustername defaulted to ocfs2. We

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
depmod -a ? Lin Shen (lshen) wrote: Switched the kernel to 2.6.9-42.Elsmp, still got the same error. [EMAIL PROTECTED] Desktop]# uname -a Linux cfs2 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT 2006 i686 i686 i386 GNU/Linux -Original Message- From: Sunil Mushran [mailto:[EMAIL

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
code is pretty well contained and isolated. while we have discussed tipc, not sure if we ever gave it a serious look. lin -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, January 04, 2007 1:21 PM To: Lin Shen (lshen) Cc: ocfs2-users

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
theoretically yes... but for practical usage go with atleast iscsi Lin Shen (lshen) wrote: So w/o shared disk, is it possible to make OCFS2 to work by utilizing GNBD or etc? lin -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, January 04, 2007 2

Re: [Ocfs2-users] update on o2net_idle_timer

2007-01-04 Thread Sunil Mushran
That and also we've seen similar issues with Broadcom TG3 drivers. We use Intel E1000 mostly and thus did not experience the same issue. As far as the configurable net timeouts goes, the patch was added into mainline on Dec 4th. So it will be available with ocfs2 1.4. We are still seeing if we

Re: [Ocfs2-users] Kernel panic - not syncing: ocfs2 is very sorry

2007-01-05 Thread Sunil Mushran
Lot of ink has been spilled on this subject. ;) Check out the heartbeat section in the FAQ. One easy solution is to increase the hb timeout to 60 secs... O2CB_HEARTBEAT_THRESHOLD = 31 We will leaning towards making that number the default in the 1.4 release. George Liu wrote: Both systems

Re: [Ocfs2-users] mount error

2007-01-09 Thread Sunil Mushran
You are using two different versions of ocfs2 on the two nodes. Different enough that they are not network compatible. It is working as designed. Consulente3 wrote: Hi, I'm new to ocfs2, and in my test's environment, i have: 2 node, becks and vaix becks can mount ocfs2 fs, but vaix can't.

Re: [Ocfs2-users] OCFS2 crash

2007-01-16 Thread Sunil Mushran
Looks to be running out of lowmem. # date # cat /proc/meminfo # cat /proc/slabinfo Run a script that dumps the above every 1 to 5 mins. That should help explain the cause. Brian Sieler wrote: Using 2-node clustered file system on DELL/EMC SAN/RHEL 2.6.9-34.0.2.ELsmp x86_64. Config:

[Ocfs2-users] ocfs2-1.2.4 RC2 released

2007-01-17 Thread Sunil Mushran
All, http://oss.oracle.com/~smushran/.ocfs2-1.2.4-0.2/ The final 1.2.4 should look very close to this drop. We still have one slippery issue open that we are working on. But, other than that, this drop is looking good. The list of patches added post 1.2.4-0.1 is as follows: r2948: fs - Allow

Re: [Ocfs2-users] ocfs2 keeps fencing all my nodes

2007-01-18 Thread Sunil Mushran
1. In SLES10, the /config has been moved to /sys/kernel/config. That's how it is on mainline. 2. To monitor heartbeat do: # watch -d -n2 debugfs.ocfs2 -R hb /dev/sdX This comand will work if you have ocfs2-tools 1.2.2. (Not sure whether sles10 ships with 1.2.2 or 1.2.1.) If 1.2.1, do: # watch

Re: [Ocfs2-users] ocfs2_cdsl_follow_link errors

2007-01-22 Thread Sunil Mushran
#define EACCES 13 /* Permission denied */ The messages are harmless. Patch to silence them has already been checked into the 1.2 repo and mainline git. Matthew Flusche wrote: I’m seeing the following errors in my two node cluster. Is this anything to be concerned with? Host information:

Re: [Ocfs2-users] kernel panic - not syncing

2007-01-22 Thread Sunil Mushran
o2net timeout cannot cause the o2hb panic. The two are totally different. From the outputs, I would guess o2hb is timing out but I cannot say for sure till I don't see the full logs. Andy Phillips wrote: Its worth pointing out that the o2net idle timer is triggering on the network heartbeat,

Re: [Ocfs2-users] kernel panic - not syncing

2007-01-22 Thread Sunil Mushran
:38 -0800, Sunil Mushran wrote: o2net timeout cannot cause the o2hb panic. The two are totally different. From the outputs, I would guess o2hb is timing out but I cannot say for sure till I don't see the full logs. Andy Phillips wrote: Its worth pointing out that the o2net idle timer

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-23 Thread Sunil Mushran
This was the lvb issue that was fixed long ago. In the 1.2 tree, it was fixed in 1.2.2. 2.6.18 should definitely have the fix for this. davide rossetti wrote: OS: Fedora Core release 4 (Stentz) KERNEL: Linux rack1.ape 2.6.17-1.2142_FC4smp #1 SMP Tue Jul 11 22:57:02 EDT 2006 i686 i686 i386

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-23 Thread Sunil Mushran
wrote: On 1/23/07, *Sunil Mushran* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: This was the lvb issue that was fixed long ago. In the 1.2 tree, it was fixed in 1.2.2. 2.6.18 should definitely have the fix for this. it seems it's even more recent: /var/log/messages.4:Dec

Re: [Ocfs2-users] unable to configure O2CB_HEARTBEAT_THRESHOLD

2007-01-24 Thread Sunil Mushran
The o2cb script fix is in ocfs2-tools 1.2.2 released Oct 2006. Ping SUSE for the update. [EMAIL PROTECTED] wrote: Using SuSE SP2 Linux running V1.0.8 of OCFS2 and the tools/console that comes with SP2 distribution. I am unable to set the* O2CB_HEARTBEAT_THRESHOLD* parameter in the

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-24 Thread Sunil Mushran
This is not a fs issue. As in the file must be alright. This is a dlm issue. The fs is asking the dlm to free the lock and the dlm is stuck. How many nodes do you have? We've fixed a bunch of dlm bugs since what you appear to be running. davide rossetti wrote: I rebooted the two faulty nodes.

[Ocfs2-users] OCFS2 1.2.4-2 released

2007-02-02 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.2.4-2. This release addresses the lowmem consumption issue that has plagued many users. It also addresses few races in the dlm relating to the lockres migration. The complete list of changes post 1.2.3 is available here:

Re: [Ocfs2-users] OCFS2 mount problem

2007-02-05 Thread Sunil Mushran
It could be that the device name is not the same across the two nodes. Do: # mounted.ocfs2 -d on both nodes. Match the device using the uuid. As in, you should see a device with the same uuid on both nodes. If not, then the device is not shared. If you do see the device on both nodes but with

Re: [Ocfs2-users] OCFS2 mount problem

2007-02-05 Thread Sunil Mushran
The device needs to be shared. As in, both nodes need to be able to see the same device concurrently. Refer to iscsi, fiber channel, aoe, etc. aibolit 66 wrote: -Original Message- From: Sunil Mushran [EMAIL PROTECTED] To: aibolit 66 [EMAIL PROTECTED] Date: Mon, 05 Feb 2007 12:46:26

Re: [Ocfs2-users] OCFS2 1.2.4-2 released

2007-02-06 Thread Sunil Mushran
That's the source. Randy Ramsdell wrote: Mark Fasheh wrote: On Tue, Feb 06, 2007 at 10:18:51AM -0500, Randy Ramsdell wrote: Is source available? http://oss.oracle.com/projects/ocfs2/dist/files/source/v1.2/ocfs2-1.2.4.tar.gz --Mark -- Mark Fasheh Senior

Re: [Ocfs2-users] ocfs2-tools-1.2.2 compile.

2007-02-06 Thread Sunil Mushran
The following patch will address this issue. The fix will be provided with the next tools release. Index: libocfs2/include/ocfs2.h === --- libocfs2/include/ocfs2.h(revision 1269) +++ libocfs2/include/ocfs2.h(revision 1270)

Re: [Ocfs2-users] 1.3.3 mount problem

2007-02-07 Thread Sunil Mushran
The datavolume code is not in mainline. But you should be able to get Oracle RDBMS to work with it. Ensure the init.ora paramater filesystemio_options is set to direct_io. Ivo Maya wrote: Hi, I need to mount ocfs2 with datavolume option on open SuSE 10.2 Machines. ocfs2 is 1.3.3 version and

Re: [Ocfs2-users] 1.2.4 symbols

2007-02-09 Thread Sunil Mushran
What does dmesg say? Randy Ramsdell wrote: Hi, Everything compiled correctly for the ocfs2 package, but so far the modules will not load with the well known module symbol error. FATAL: Error inserting ocfs2 (/lib/modules/2.6.16.27-0.6-smp/kernel/fs/ocfs2/ocfs2.ko): Unknown symbol in module,

  1   2   3   4   5   6   7   8   9   10   >