Re: [Ocfs2-users] RedHat Node Panic Weekly

2006-08-16 Thread Sunil Mushran
This specific problem was fixed in 1.2.2. The latest release is 1.2.3. Upgrade to 1.2.3. Jim Erb wrote: See earlier post - May 10th "Node Panic" Can anyone tell me what might be happening here? I have a 3 node cluster running under RH AS 4 (2.6.9-34.ELsmp) with ocfs2 v. 1.2.1. I've upgraded t

Re: [Ocfs2-users] Heartbeat and OCFS2

2006-08-17 Thread Sunil Mushran
ocfs2 currently bundles its own heartbeat (o2hb) and does not depend on any external package. Milind Dumbare wrote: Hi all, Is it necessary to have heartbeat (/etc/init.x/heartbeat) service working for mounting and unmounting ocfs2 volumes?

Re: [Ocfs2-users] OCFS2 over DRBDv8

2006-08-17 Thread Sunil Mushran
As far as ocfs2 is concerned, bio_add_page() is failing. The one thing that springs to mind is that o2hb sets bio->bi_sector to 512 bytes and not the block size. Kilian CAVALOTTI wrote: Hi all, I'm new to OCFS2, but not so new to DRBD. I'd like to use the new primary/primary feature of DRBDv8

Re: [Ocfs2-users] ocfs2 and device-mapper

2006-08-18 Thread Sunil Mushran
Definitely increase the hb timeout (O2CB_HEARTBEAT_THRESHOLD). http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#HEARTBEAT That's for starters. To figure out why it dies, I will need to see the oops dump. But my hunch is that it is hb related and that upping the default time

Re: [Ocfs2-users] Heartbeat and OCFS2

2006-08-18 Thread Sunil Mushran
Which distro are you using? If you are on SLES10, it allows you to choose the cluster stack. Milind Dumbare wrote: So is there any way to stop OCFS2's heartbeat? Could there be any problems due to existance of both OCFS's and native heartbeat? I experianced some problems with these. I tried O

Re: [Ocfs2-users] Postfix cluster with OCFS2 mboxes?

2006-08-21 Thread Sunil Mushran
That error message is not only harmless but was silenced in OCFS2 1.2.1. Means you are running a fairly old release. Robert Edmonds wrote: Hi, I'd appreciate hearing from anyone who is running a Postfix cluster with shared mboxes mounted on an OCFS2 volume. (It looks like directory entries are

Re: [Ocfs2-users] ocfs2-users@oss.oracle.com

2006-08-21 Thread Sunil Mushran
You are running 2.6.5-7.244-smp. Upgrade to 2.6.5-7.257 at least. Kai Nielsen wrote: Hi! Is the the place to report/discuss ocfs2 related kernel bugs? We just had some file system corruption on a two node SLES9SP3 cluster with shared ocfs2 filesystem. /var/log/messages told me: Aug 17 15:56

Re: [Ocfs2-users] re: question on adding a node to RAC cluster and o2cb

2006-08-21 Thread Sunil Mushran
y it didn't like node number 2 .. Any ideas ? BTW, this time I was able to mount the ocfs2 filesystem without a problem... this was the whole reason for this thread.. previously I had tried to add a node with ocfs2console and propagation .. but it didn't work until I shutdown o2cb. -

Re: [Ocfs2-users] re: question on adding a node to RAC cluster and o2cb

2006-08-21 Thread Sunil Mushran
lem... ? I've since change the virtual IP - -peter Sunil Mushran wrote: Check the contents of /etc/ocfs2/cluster.conf. Are you sure there is no node number 2 in it. Else email me a copy of the config file to see if I can duplicate it in house. BTW, the directions for adding nodes

Re: [Ocfs2-users] out of memory... doing heavy IO on ocfs2 is wasting (low) memory?!

2006-08-23 Thread Sunil Mushran
The patch fix is in testing. We are aiming to release 1.2.4 sometime late september. Peter McMahon wrote: Kurt we are facing the exact same problem. We use OCFS2 ocfs2-2.6.9-34.ELsmp-1.2.3-1 for aSHARED APPL_TOP in an 11i env. We hit the problem when trying to do backups! Any timeframe on t

Re: [Ocfs2-users] Wrong dm device used

2006-08-29 Thread Sunil Mushran
Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions. We have to teach it new tricks. :) Fabio Corazza wrote: Hi there, I've just setup an EVMS cluster with Heartbeat 2.0.7 and OCFS2. Everything seems to be working fine except this: [EMAIL PROTECTED] photos]# mounted.ocfs2 -d

Re: [Ocfs2-users] Wrong dm device used

2006-08-30 Thread Sunil Mushran
ut journal-options? Thanks for your attention, highly appreciated. Fabio Sunil Mushran wrote: Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions. We have to teach it new tricks. :) Fabio Corazza wrote: Hi there, I've just setup an EVMS cluster with Heartbeat

Re: [Ocfs2-users] nbd RAID and ocfs2

2006-09-11 Thread Sunil Mushran
As far as ocfs2 is concerned, you can use debugfs.ocfs2 to check whether you are seeing the same "physical device" on the two nodes. # echo "stats" | debugfs.ocfs2 -n /dev/sdX Compare the UUIDs. # echo "ls -l /path" | debugfs.ocfs2 -n /dev/sdX List some dir. Carsten Hensiek wrote: Hi there,

Re: [Ocfs2-users] self fencing and system panic problem afterforced reboot

2006-09-14 Thread Sunil Mushran
Not sure why a power outage should cause this. Do you have the full stack of the oops? It will show the times taken in the last 24 operations in the hb thread. That should tell us as to what is up. Holger Brueckner wrote: i just discovered the ls, cd, dump and rdump commands in debugfs.ocfs2.

Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot

2006-09-15 Thread Sunil Mushran
. Bernd -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Holger Brueckner Sent: Friday, September 15, 2006 10:21 AM To: Sunil Mushran Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot i guess i

Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot

2006-09-15 Thread Sunil Mushran
fore going thru). - Original Message - From: "Holger Brueckner" <[EMAIL PROTECTED]> To: "Sunil Mushran" <[EMAIL PROTECTED]> Cc: Sent: Friday, September 15, 2006 1:20 AM Subject: Re: [Ocfs2-users] self fencing and system panicproblem afterforced reboot i guess i fo

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-20 Thread Sunil Mushran
Did you remove some large files recently? If so, check the orphan_dir and truncate_log for all the slots. 1. Start debugfs: # debugfs.ocfs2 /dev/sdX 2. List system directory: > ls -l // 3. List files in all orphan_dir(s): > ls -l //orphan_dir: If there are files, means some process in the

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-20 Thread Sunil Mushran
.. 4301446 -rw-r--r-- 0 503 500 0 12-Aug-2006 10:40 0041a286 -----Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Wednesday, September 20, 2006 12:32 PM To: Matthew Flusche Cc: Ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users]

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-20 Thread Sunil Mushran
would be to stop every process accessing this file system? Possibly umount/mount the file system? would fsck.ocfs2 clean this up? -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Wednesday, September 20, 2006 1:52 PM To: Matthew Flusche Cc: Ocfs2-users@oss.oracl

Re: [Ocfs2-users] Error mounting ocfs2 partition...

2006-09-20 Thread Sunil Mushran
0x83 is ok. Why is "node_count = 2"? Shouldn't it be 1. Fix that, restart the o2cb cluster and do the mount. Let us know how it goes. Jeff Vincent wrote: Hi, I'm new to this list and to the ocfs2 system and to clustering in general (3 strikes?--ack!). I have some problems with my first atte

Re: [Ocfs2-users] ocfs2 fencing on reboot of 2nd node

2006-09-21 Thread Sunil Mushran
What is your O2CB_HEARTBEAT_THRESHOLD set to? For more, refer: http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#HEARTBEAT [EMAIL PROTECTED] wrote: I'm performing some testing with ocfs2 on 2 nodes with Red Hat AS4 Update 4 (x86_64) and (mulitpath included in the 2.6 kern

Re: [Ocfs2-users] ocfs2 mounted: now what?

2006-09-22 Thread Sunil Mushran
Yes, though I am not too sure about drdb. I would look at iscsi atleast. Fabio Corazza wrote: Andreas Hasenack wrote: Ok, I guess I need some sort of SAN which ocfs2 doesn't provide. I was confused by the competition's GNBD feature (from gfs). You need the same block device exported t

Re: [Ocfs2-users] ocfs2 fencing on reboot of 2nd node

2006-09-22 Thread Sunil Mushran
s or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer. *Sunil Mushran <[EMAIL PROTECTED]>* Sent by: [EMAIL PROTECTED] 09/21/2006 08:04 PM To [EMAIL PRO

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-25 Thread Sunil Mushran
24, 2006 3:06 PM To: Sunil Mushran Cc: Ocfs2-users@oss.oracle.com Subject: RE: [Ocfs2-users] ocfs2 - disk usage inconsistencies I unmounted the file system from each node and ran the fsck.ocfs2 on it. That cleaned up the orphan_dirs but I did not get my 17 GB of space back. Any other suggestions?

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-25 Thread Sunil Mushran
ew Flusche wrote: I unmounted the file system from each node and ran the fsck.ocfs2 on it. That cleaned up the orphan_dirs but I did not get my 17 GB of space back. Any other suggestions? Am I going to have to backup,format,restore to fix this? -Original Message----- From: Sunil Mushr

Re: [Ocfs2-users] ocfs2 fencing on reboot of 2nd node

2006-09-25 Thread Sunil Mushran
2006 06:14 PM To "Sunil Mushran" <[EMAIL PROTECTED]>, <[EMAIL PROTECTED]> cc [EMAIL PROTECTED], ocfs2-users@oss.oracle.com Subject Re: [Ocfs2-users] ocfs2 fencing on reboot of 2nd node Looks as you have 2 hosts on a SINGLE SCSI cont

Re: [Ocfs2-users] ocfs2 - disk usage inconsistencies

2006-09-25 Thread Sunil Mushran
0 2-May-2006 16:11 truncate_log:0002 39 -rw-r--r-- 1 0 0 0 2-May-2006 16:11 truncate_log:0003 debugfs: -Original Message----- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Monday, September 25, 2006 5:09 PM To: Matthew Flusche

Re: [Ocfs2-users] ocfs2 on mailserver lock up

2006-09-25 Thread Sunil Mushran
You may want to ping Novell to get the 1.2.3 drop of OCFS2. That's because it is the latest and greatest. Having said that we'll need more information. As in, what syscall did strace show as taking time. What is the memory usage like? cat /proc/meminfo, cat /proc/slabinfo That is, under productio

Re: [Ocfs2-users] ocfs2 on mailserver lock up

2006-09-26 Thread Sunil Mushran
I will need atleast "strace -tt -T" and /proc/meminfo, /proc/slabinfo to proceed. write() could be slow for a lot of reasons. [EMAIL PROTECTED] wrote: On 9/26/06, *Sunil Mushran* <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote: You may want to ping Novell to

Re: [Ocfs2-users] Initramfs environment, o2cb prereq's

2006-09-26 Thread Sunil Mushran
Christopher Hawkins wrote: Hello, I am trying to start an ocfs2 cluster and mount a filesystem in an initramfs environment. Networking is up, hostname set, and the o2cb init script runs with no errors. Now the odd behavior starts... The mount of the ocfs2 filesystem fails, and I get dropped in to

Re: [Ocfs2-users] How do I remove old devices

2006-09-28 Thread Sunil Mushran
ocfs2console just scans the partitions in /proc/partitions and looks for the superblock on the 3rd block. Easiest would be to use bvi (or any binary editor), search for the signature OCFSV2, and change it. # bvi -s 10240 /dev/sda1 /OCFSV2 Philippe Andries wrote: Hi, How do I remove 'ghost de

Re: [Ocfs2-users] How do I remove old devices

2006-09-29 Thread Sunil Mushran
bvi-1.3.2.src.tar.gz > gunzip: bvi-1.3.2.src.tar.gz: not in gzip format > > So I am stuck. :-( > > Any other idea welcomed. An other binary editor (could not find any > on redhat)? An other solution? > > Thanks in advance, > > Philippe Andries > > > -O

Re: [Ocfs2-users] Use of OCFS2 file systems.

2006-09-29 Thread Sunil Mushran
Yes. Bill Wells wrote: All, Can someone comment on whether it is recommended to use the OCFS2 file system for the admin directories of a RAC database. Specifically, for bdump, udump, cdump, etc. This is being considered on RHEL4-U4 with 10gR2 on a 3 node cluster. Thanks much, Bill Wells -

[Ocfs2-users] informal headcount

2006-09-29 Thread Sunil Mushran
uture releases. If you are uncomfortable posting this information to a mailing list, you can email me privately. Thank you Sunil Mushran 1. In how many Clusters are you using OCFS2? 2. How many Nodes (approx) do the Clusters have? 3. How many

Re: [Ocfs2-users] re problem with cluster

2006-09-29 Thread Sunil Mushran
From the logs, you appear to be running < 1.2.1 release of ocfs2. Upgrade to 1.2.3. Andrew Brunton wrote: Hi Guys, I’ve got a different error this time with ocfs2, or is this the same thing ? I’ve also noticed a few kernel: mbox_read: Bad State which looks like its something to do with the

Re: [Ocfs2-users] ocfs2_follow_link error

2006-10-02 Thread Sunil Mushran
The errors are harmless. The dangling symlink (ENOENT -2) has been silenced in mainline and ocfs2 1.2.3. Maybe we should silence ELOOP (-40) too. davide rossetti wrote: dear all, Today I realized I had such stuff in my logs: (2502,0):__dlm_print_nodes:377 Nodes in my domain ("41AE1AA4C5534E5

[Ocfs2-users] Re: FW: Use of OCFS2 file systems.

2006-10-03 Thread Sunil Mushran
://oss.oracle.com/osswiki/OCFS2. If you wish to contribute code, as this is an open source project, feel free to ping me or the ocfs2-devel@oss.oracle.com mailing list. Thanks Sunil Mushran Hi Sunial, What are your thoughts about this message on the mailing lists? Thanks! Sanjeet

Re: [Ocfs2-users] Re: FW: Use of OCFS2 file systems.

2006-10-04 Thread Sunil Mushran
File a bug on bugzilla (oss.oracle.com/bugzilla) with the full oops trace and any other information that seems relevant. Galan Merchan, Martin wrote: Hello, I’m working with OCFS2 on Radhat Advanced Server 4 Patch 3 and I had kernel panics too. I use OCFS2 only for RAC archive logs and RMAN b

Re: [Ocfs2-users] Re: FW: Use of OCFS2 file systems.

2006-10-04 Thread Sunil Mushran
erver rebooted by OCFS and otehr by another part of the cluster (HA or RAC) - but result is exactly this - _all_ OCFSv2 panic on a shport network/san outage, in all cases. - Original Message - From: "Sunil Mushran" <[EMAIL PROTECTED]> To: "ocfs2-users" Sent: Tu

Re: [Ocfs2-users] Resizing mountpoint in ocfs2

2006-10-05 Thread Sunil Mushran
Yes, the last patch to add this feature is in review. We will release this as part of ocfs2-tools 1.2.2. Kerr-Sheppard, Stephen wrote: Has anyone had to resize a mountpoint in ocfs2. In ocfs version 1 it was a case of unmounting and using the resizeocfs command. Is this still the same for ocf

Re: [Ocfs2-users] Resizing mountpoint in ocfs2

2006-10-06 Thread Sunil Mushran
- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: 05 October 2006 19:23 To: Kerr-Sheppard, Stephen Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Resizing mountpoint in ocfs2 Yes, the last patch to add this feature is in review. We will release this as part of ocfs2-tools 1.2.2

Re: [Ocfs2-users] 2 Node cluster, and nodes OS hang

2006-10-06 Thread Sunil Mushran
tcpdump -i eth1 -C 10 -W 15 -s 1 -Sw /tmp/`hostname -s`_tcpdump.log -ttt 'port ' & Do this on both nodes before mounting on the second node. Ping me with the path to the logs. [EMAIL PROTECTED] wrote: Hello All, I have a NAS that I would like to use ocfs2 on. Currently there are thr

Re: [Ocfs2-users] Getting Started with ocfs2

2006-10-11 Thread Sunil Mushran
1. Do "/etc/init.d/o2cb load" Appears you have hand loaded the modules. Use the script. It needs to do some mounting too. 2. Run "ocfs2console". No -N option. Which distro are you on? Martin Evans wrote: John Peeken, Linux OS Support wrote: Martin, http://oss.oracle.com/projects/ocfs2/document

Re: [Ocfs2-users] Getting Started with ocfs2

2006-10-11 Thread Sunil Mushran
Martin J. Evans wrote: fine but on selecting cluster/configure nodes I still get dialogue saying "Could not query the state of the cluster stack. This must be resolved before any OCFS2 filesystemcan be mounted." Could be because the script is installed as o2cb and not o2cb.init. Fedora Co

Re: [Ocfs2-users] out of memory... doing heavy IO on ocfs2 is wasting (low) memory?!

2006-10-11 Thread Sunil Mushran
Still in testing. It is a larger patch than normal and thus requires more time/effort. Once we are comfortable with it, we will look into releasing the patch for others to test before releasing 1.2.4. Jonah H. Harris wrote: What's the status on this? I've researched Bugzilla, SVN, and the lists

[Ocfs2-users] disk heartbeat timeout poll

2006-10-11 Thread Sunil Mushran
Thanks for all the replies in the previous usage poll. One of the chief concerns expressed was the (very) low default disk heartbeat timeout setting. Well, we want to bump it up but to what? Here are some qs the answers to which will help us determine that value. 1. What is the your disk heartb

Re: [Ocfs2-users] kernel BUG at :58347!

2006-10-12 Thread Sunil Mushran
The ocfs2 shipping with that kernel is missing few dlm patches. I'll put together some patches. There is a bugzilla logged on this. Bleeding Edge wrote: I've gotten the error below several times on different builds on different hardware: The setup is a bit different from the norm, it's a xen

Re: [Ocfs2-users] kernel BUG at :58347!

2006-10-18 Thread Sunil Mushran
Or a path I can explore to begin getting this kernel up to speed with the correct ocfs2-dlm? Or possibly are the correct ocfs2-dlm files in the kernel source tree for later kernels that I could upgrade to? Thanks for the replies so far! On 10/12/06, *Sunil Mushran* <[EMAIL PROTECTED] <ma

Re: [Ocfs2-users] oscfs2_extend_file:789 ERROR

2006-10-19 Thread Sunil Mushran
This issue was resolved in 1.2.2. Contact your SuSE rep to get a hold of a PTF kernel with the latest OCFS2 patches (1.2.3.) Pickel, Gerd - Betax-Systems GmbH (IT) wrote: Hi all, last night one of our cluster nodes running SUSE SLES9/SP3 stoped working. We searched the internet for a solution,

Re: [Ocfs2-users] (no subject)

2006-10-19 Thread Sunil Mushran
Please could you explain what you are trying to do. SUVANKAR MOITRA wrote: dear team, I want to access files from ocfs2 to ext3 in Oracle 10g RAC , how can i ? regards Suvankar __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam

Re: [Ocfs2-users] kernel BUG at :58347!

2006-10-19 Thread Sunil Mushran
at I could upgrade to? Thanks for the replies so far! On 10/12/06, *Sunil Mushran* <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote: The ocfs2 shipping with that kernel is missing few dlm patches. I'll put together some patches. There is a bugzilla logged on thi

Re: [Ocfs2-users] Fragmentation problem: Archive logs on ocfs1 and ocfs2

2006-10-19 Thread Sunil Mushran
The fragmentation issue seen in OCFS (Release 1) does not apply to OCFS2. The only thing in common between OCFS and OCFS2 are the four letters in the name. ;) Malhotra, Pradeep (GTI) wrote: Hello All, I have few questions around our use of ocfs1/2 for archive logs on 10G RAC. Is there an articl

Re: [Ocfs2-users] SUSE Patches

2006-10-20 Thread Sunil Mushran
Ping Novell. They issue interim PTF SLES kernels with the required fix(es) to help users tide over until the formal release. Needless to add, you need to have Novell Support. Andy Kipp wrote: Hello all, I am running SLES9 with the latest kernel patches (2.6.5-7.282-bigsmp) and ocfs2 version (1

[Ocfs2-users] OCFS2 Tools 1.2.2 released

2006-10-20 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 TOOLS 1.2.2. This release is fully compatible with OCFS2 1.2.1+. The summary of changes in this release are as follows: * tunefs.ocfs2: Volume offline resize (growth) * tunefs.ocfs2: Regenerate volume uuid * debugfs.ocfs2: Commands bmap, findp

Re: [Ocfs2-users] RHEL 4 hotfix RPMs?

2006-10-23 Thread Sunil Mushran
# ./configure --with-kernel=/usr/src/kernels/2.6.9-42.X.EL-smp-i686/ # make rhel4_2.6.9-42.X.EL_rpm The rpms will be in the rpmdir as specified in ~/.rpmmacros. ~$ cat .rpmmacros %_topdir/rpmbuild/user %_tmppath /rpmbuild/user/tmp %_sourcedir /rpmbuild/user/SOURCES %_specdir

Re: [Ocfs2-users] RHEL 4 hotfix RPMs?

2006-10-23 Thread Sunil Mushran
It does make sense. :) Brian Long wrote: Thanks, Sunil, for the quick response. Thanks for the latest ocfs2- tools offline resize support, btw. Would it make sense to add this to the FAQ? /Brian/ On Mon, 2006-10-23 at 12:37 -0700, Sunil Mushran wrote: # ./configure --with-kernel=/usr

Re: [Ocfs2-users] 1.2.2 dump issue

2006-10-25 Thread Sunil Mushran
As the ocfs2 home page suggests, when building 1.2.x against mainline 2.6.14 and above, specify GENERIC_DELETE_INODE_NOT_TRUNCATES=1. Peter Larsen wrote: I'm running 1.2.2 here - compiled from source, and while I can read files, trying to delete a file on my OCFS2 volume produces the following:

Re: [Ocfs2-users] lvm2 not cluster aware - okay, so how should Istripe my LUNs?

2006-10-25 Thread Sunil Mushran
Fabio Corazza wrote: Last but not least.. a question for Sunil if he's gonna read this.. when OCFS2 will support data-on-inode would we need to reformat the file systems or will the new module be compatible with the <1.4 on-disk data? I am envisioning a compat flag to be added on existing volu

Re: [Ocfs2-users] OCFS2 Fencing and Locking MSA500 Array: Help

2006-10-25 Thread Sunil Mushran
Oct 11 05:15:28 vhaispora01 kernel: cciss0: unsolicited abort f7000250 Oct 11 05:15:28 vhaispora01 kernel: cciss0: retrying f7000250 That's where the problem begins. The cciss driver is unable to to complete the ios due to a bus reset maybe. Ping HP or whoever your contact is for the MSA500.

Re: [Ocfs2-users] BUG: unable to handle kernel NULL pointer dereference

2006-10-27 Thread Sunil Mushran
Please file a bugzilla with the details provided. It is easier to manage bugs that a way. Thanks Christian Schlittchen wrote: Thanks to syncronous writes on the log-files I finally managed to get a log of the regular panics we experience. The setup is as follows: Three blades (IBM HS20) acces

Re: [Ocfs2-users] Unexpected reboot / crash

2006-10-27 Thread Sunil Mushran
The first issue could be because you don't have ocfs2-tools 1.2.2. The earlier version was missing a line in the ocfs2 init script. Rafal Maliszewski wrote: Hi guys I installed ocfs2 on 4 node (redhat 4u3) on shared FC devices ( EMC storage ). So I've noticed several problems: 1. When I r

Re: [Ocfs2-users] Interesting Error

2006-10-30 Thread Sunil Mushran
Which version of OCFS2? Did you run fsck.ocfs2 -f on that device? Do: # echo stat <6518860> | debugfs.ocfs2 -n /dev/sdX >/tmp/ext.out Email ext.out. Andy Kipp wrote: Anybody have any idea what this error involves? Or how to resolve it? Oct 30 05:11:24 groupwise-1-mht kernel: (8494,0):ocfs2_ex

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
Are you using NFS by any chance? I am looking into bug#790 that also encounters the same error (ESTALE). Matthew Flusche wrote: I received the following error messages in the system logs. Is this anything to be concerned with? kernel: (4074,0):ocfs2_populate_inode:234 ERROR: Invalid dino

Re: [Ocfs2-users] Interesting Error

2006-10-31 Thread Sunil Mushran
Replace sdX with the device on which the ocfs2 fs exists. You can use "mount | grep ocfs2" to find that volume. If the inode on disk is good, one explanation for the issue could be the lvb bug which was fixed in 1.2.2. Ping Novell to get a PTF kernel with ocfs2 1.2.3. Andy Kipp wrote: > Which

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
So it is bug#790. It just may be a case of unnecessary error messages for you. I am still investigating it. Matthew Flusche wrote: Yes, one of the clustered file systems is shared with nfs. -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 31

Re: [Ocfs2-users] Ocfs2 and low memory

2006-10-31 Thread Sunil Mushran
To monitor ocfs2 memory usage, do: # cat /proc/slabinfo | egrep 'ocfs|dlm|size-256 |size-32 ' ocfs2_lock16226 16 2261 : tunables 120 60 0 : slabdata 1 1 0 ocfs2_inode_cache 22 24 115231 : tunables 24 12 0 : slabdata 8

Re: [Ocfs2-users] Newbie questions -- is OCFS2 what I even want?

2006-11-03 Thread Sunil Mushran
You are probably looking for a distributed file system. Check out afs and/or v9fs. Thad Beier wrote: Dear Sirs and Madams, I run a small visual effects production company, Hammerhead Productions. We'd like to have an easily extensible inexpensive relatively high-performance storage network u

Re: [Ocfs2-users] ocfs2/small files problem

2006-11-06 Thread Sunil Mushran
512 blocksize is almost never recommended. :) Use atleast 1K. The FAQ has the explanation of the terms blocksize and clustersize. tao.ma wrote: Brian Sieler wrote: I have an ocfs2 file system with 4K blocks and 128K clusters--using it as a general-purpose clustered file system. Problem: Large

Re: [Ocfs2-users] About heartbeat threshold and drbd ...

2006-11-06 Thread Sunil Mushran
Refer to the ocfs2 faq. Sébastien CRAMATTE wrote: Hi I've setuped drbd 0.8pre6 over OCFS2 and I've got some stability problems ... I've submited somes erros as bug on ocfs2 bugzilla but doesn't appear I supose that are already referenced I've send this to the drbd ml and "Lars" talk a

Re: [Ocfs2-users] Stopping O2CB failed / kernel panic on shutdown .... ???

2006-11-06 Thread Sunil Mushran
Shutdown ordering is incorrect. Shutdown ocfs2 first. Means all ocfs2 volumes should be umounted. Followed by o2cb, followed by drdb followed by network. Sébastien CRAMATTE wrote: Hi I've setup ocfs2 1.2.3 + drbd 0.8 seems that works excepts ... that when I shutdown I obtain this ... Note

Re: [Ocfs2-users] acording to FAQ, I need some explanation about O2CB_HEARTBEAT_THRESHOLD setting...

2006-11-06 Thread Sunil Mushran
http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#HEARTBEAT # How can one change the parameter value of O2CB_HEARTBEAT_THRESHOLD? This parameter value could be changed by adding it to /etc/sysconfig/o2cb and RESTARTING the O2CB cluster. This value should be the SAME on ALL

Re: [Ocfs2-users] DMesg error on startup ...

2006-11-06 Thread Sunil Mushran
The startup order should be... ==> network, o2cb, ocfs2. Sébastien CRAMATTE wrote: Hi When my system boot I've got this error in dmesg : mount.ocfs2: Unable to access cluster service Cannot initialize cluster -- Mounting local filesystems... kjournald starting. Commit interval 5 seco

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-09 Thread Sunil Mushran
Why are you looking at iops and not the io thruput? What is the actual io thruput? Please could you share some iostat numbers with us. In all our tests, we've seen very little difference in the actual io thruput between raw and ocfs2. Clustersize will mainly affect the alloc/dealloc performance.

Re: [Ocfs2-users] about 2 nodes enviroment and metalink note 394827.1

2006-11-09 Thread Sunil Mushran
I would imagine you are using RHEL4. If so, upgrade the ocfs2-tools to 1.2.2. The previous version of the ocfs2 init script did not always umount ocfs2 volumes on clean shutdowns leading to this problem. [EMAIL PROTECTED] wrote: Hi to all: In 2 nodes environment I've 'suffered' the 'reboot 1st

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-09 Thread Sunil Mushran
3,400 IO/sec while the same benchmark with the same data will max out at 7K+ IO/sec on RAW. I'll grab the iostat data which we've kept over time and try to make some sense of it before posting anything additional. Thanks. /Brian/ On Thu, 2006-11-09 at 10:20 -0800, Sunil Mushran wrot

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-10 Thread Sunil Mushran
Yes, it is required to enable asynch io. Brian Long wrote: Sunil, My DBA had a question. Are they supposed to use Filesystemio_options = setall? This enableds async and direct IO. I guess they didn't enable it by default since they were originally testing raw devices. /Brian/ _

Re: [Ocfs2-users] Soft and Hard Readonly?

2006-11-13 Thread Sunil Mushran
To mount readonly, do: mount -o ro /dev/sdX /dir The difference between soft and hard is that in the latter the heartbeat is not started. Meaning no need to join a dlm domain. The latter only works on actual readonly devices. Sutterfield, Geary L. wrote: I've seen some discussions in the past

Re: [Ocfs2-users] Soft and Hard Readonly?

2006-11-13 Thread Sunil Mushran
The MITRE Corporation 703-983-5667 -Original Message----- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Monday, November 13, 2006 4:39 PM To: Sutterfield, Geary L. Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Soft and Hard Readonly? To mount readonly, do: mount -o ro /de

Re: [Ocfs2-users] oom-killer patch: OCFS2 1.2.4

2006-11-13 Thread Sunil Mushran
Yes, it is being reviewed and being tested by us and our partners. We can make the patch available for open testing when it passes the review hopefully within a week. Michał Wilkowski wrote: Hello, can you estimate the date when the oom-killer patch (1.2.4) will be available? Is your patch

Re: [Ocfs2-users] out of memory problem concerning 64-bits?

2006-11-13 Thread Sunil Mushran
Are you sure it is ocfs2 that is eating memory? # egrep 'ocfs|dlm|size-256 |size-32 ' /proc/slabinfo # cat /proc/fs/ocfs2_dlm/*/stat Email the outputs. Michał Wilkowski wrote: Hello, we are currently running in production the system based on Redhat Enterprise Linux 4 Update 3, ver. 32-bits. W

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-13 Thread Sunil Mushran
Considering o2net only cares whether it is connected to the other node or not, it should not make a difference whether one unplugs node 0 or node 1. The result should be the same. Node 1 should fence in both cases. Do you see messages indicating that the node(s) have lost connectivity? If so, c

Re: [Ocfs2-users] frozen ocfs2 filesystem under heavy webserver load

2006-11-13 Thread Sunil Mushran
None of these locks are busy. So they should not be the cause of the problem. Start with the version of ocfs2. Also, which kernel? What does top say? Is some process spinning? Also, what does this stresstest entail? Stephan Hendl wrote: Hi, I use a cluster of 4 nodes with ocfs2 as a webserve

Re: [Ocfs2-users] out of memory problem concerning 64-bits?

2006-11-14 Thread Sunil Mushran
kills occur regularly (once within a few days). I suppose that OCFS2 eats memory because since OCFS2 filesystem has been mounted, the LowMemory is decreasing and the number of OCFS2 locks is increasing (as you can see in the output). Can you give the hint what I can read in slabinfo? Regards Mi

Re: [Ocfs2-users] Ocfs2 errors on 3 node cluster

2006-11-14 Thread Sunil Mushran
It will be easier if you file a bug on oss.oracle.com/bugzilla with all the details. Like messages files from all nodes, etc. Why are you using 1.2.1? 1.2.3 has been out for few months now. Randy Ramsdell wrote: Hi, Maybe someone could elaborate on these re-occuring ocfs2 errors that always re

Re: [Ocfs2-users] Bad magic number in inode

2006-11-15 Thread Sunil Mushran
The quick detect just looks for the superblock which is in the third block of the device. The full detect looks up the superblock and then the system directory. In your case it fails to locate the latter. This is one of the quirks when using an unpartitioned disk and later partitioning it. The pa

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-15 Thread Sunil Mushran
If your aim is to compare db performance on raw v ocfs2, look at the actual thruput. As in, transactions per sec. Use whatever tool you want or that corresponds best to your workload. The detail statspack data provided by enterprise manager is useful in tuning the db itself. Say, for sizing the b

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
will fence and node 0 will reply the journal and stay online. Any input is greatly appreciated. Thanks, Colin Farley Network Administrator E-Care Contact Center Services Phone:(204) 940-6244 Fax:(204) 940-7394

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
anding IO on cluster file system). Well known problem with OCFSv2. One solution is to add 3-d node and use interface bonding (be sure that interface convergeency time is less that o2cb timeout). - Original Message - From: <[EMAIL PROTECTED]> To: "Sunil Mushran" <[EMAIL

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
everything will change. - Original Message - From: "Sunil Mushran" <[EMAIL PROTECTED]> To: "Alexei_Roudnev" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]>; Sent: Wednesday, November 15, 2006 11:03 AM Subject: Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 p

Re: [Ocfs2-users] re: o2hb_write_timeout:270 ERROR: Heartbeat write timeout

2006-11-16 Thread Sunil Mushran
On nodes db01 and db03 hb timed-out at 17:12:49. However, the nodes did not fully panic. As in, the network was shutdown but the hb thread was still going strong for some reason. Within 10 secs of that, by 17:12:59, db02 detected loss of network connectivity with both nodes db01 and db03. However

Re: [Ocfs2-users] re: o2hb_write_timeout:270 ERROR: Heartbeat write timeout

2006-11-16 Thread Sunil Mushran
BTW, you appear to be running with the default 12 sec timeout. From what I've gathered, most users are appear to be using a higher 60 secs timeout. 12 secs does not leave much room for error. We are thinking of increasing the default in the next drop. Sunil Mushran wrote: On nodes db0

Re: [Ocfs2-users] re: o2hb_write_timeout:270 ERROR: Heartbeat write timeout

2006-11-22 Thread Sunil Mushran
down." I have increased my O2CB_HEARTBEAT_THRESHOLD to 61, but where is this message getting "10 seconds" from? Also this message is displayed because dbo2 was not able to check into the hearbeat filesystem right ? - -peter Sunil Mushran wrote: On nodes db01 and db03 hb tim

Re: [Ocfs2-users] Oracle 9i RAC on OCFS2

2006-11-27 Thread Sunil Mushran
Refer to CDSL (Conext Dependent Symbolic Links) in the OCFS2 user's guide. Marcel Savelkoul wrote: Hi, I'm setting up a 2-node Oracle 9i RAC on OCFS2. But I have some problems with understanding how the shared Oracle_Home is being used. For instance there is the *$ORACLE_HOME/oracm/admin/cmc

Re: [Ocfs2-users] two nodes not joining the same cluster / domain

2006-11-27 Thread Sunil Mushran
Are you sure you have a shared disk? # mounted.ocfs2 -d /dev/sdX Do it on both nodes. The UUID should be the same. rain c wrote: hi, i am new to ocfs2 and i think this is a newbee problem ;-) i have a 2-node ocfs2 cluster, but when i mount the ocfs2-partition on both nodes they both use slot

Re: [Ocfs2-users] OCFS 1.2.4 and extended attributes

2006-12-01 Thread Sunil Mushran
So we hope to pre-release 1.2.4 next week. The main patch looks ready. We are adding one more small patch to it. Fingers crossed. Extended Attrs is being targetted for 1.4... sometime next year. Cline, Ernest wrote: Hello, I'd also like to throw my hat in as a willing tester of OCFS 1.2.4, if p

Re: [Ocfs2-users] OCFS2 file and inode limits

2006-12-04 Thread Sunil Mushran
Number of files/inodes in the entire fs is limited only by space. However, on a per-directory basis, while the real limit is still very large, the practical limit is in 10s of thousands. Matt Soccio wrote: I am evaluating OCFS2 for a simple load balanced cluster setup for mail, web, and file se

Re: [Ocfs2-users] Re: Ocfs2 - unable to load configfs ?

2006-12-04 Thread Sunil Mushran
Where did you get this kernel from? Prashant Tambe wrote: Hi, Did you ever get this error about ocfs2console could not start cluster stack resolved? Because I am having the same issue and I can't find any documentation about it. If you found the solution, can you email me? [EMAIL PROTECTED] I h

Re: [Ocfs2-users] OCFS2 file and inode limits

2006-12-05 Thread Sunil Mushran
Not till we implement some sort of directory indexing. Nohez wrote: Is there any timeframe set for removing the 32,000 directories limit ? We were considering OCFS2 deployment but cannot because of this limitation. Thanks Nohez On Mon, 4 Dec 2006, Sunil Mushran wrote: Number of files

Re: [Ocfs2-users] OCFS2 and berkeley database files

2006-12-05 Thread Sunil Mushran
You are on a very old release of OCFS2. The OCFS2 homepage and FAQ both list a SLES9 kernel version newer than the one you are using. But that may not be the reason for the error. My bet is that bdb is attempting to create a shared writeable mmap that ocfs2 1.2 does not support. [EMAIL PROTECT

Re: [Ocfs2-users] Oracle Application Server 10.1.2.0.2 Install on OCFS2

2006-12-06 Thread Sunil Mushran
strace apache. That may provide us with some clues. [EMAIL PROTECTED] wrote: Hello all, Has anyone installed Oracle Application Server 10.1.2.0.2 Infrastructure tier including the preseeded 10.1.0.4 database (High Availability option otherwise known as a cold failover cluster) on OCFS2 wher

<    1   2   3   4   5   6   7   8   9   10   >