Re: [Ocfs2-users] Linux kernel crash due to ocfs2

2011-09-15 Thread Sunil Mushran
7487795/root/o2image.ppc.dbg 1006-1009 rwxp 1006 00:00 0 [heap] f768-f7ff rw-p f768 00:00 0 ff9a-ffaf rw-p ff9a 00:00 0 [stack] Aborted (core dumped) On Thu, 2011-09-08 at 12:10 -0700, Sunil Mushran wrote: http://oss.oracle.com

Re: [Ocfs2-users] The mounting of too many OCFS2 volumes (i.e. 50 or more) per cluster

2011-09-15 Thread Sunil Mushran
That's very old. We have users having 50+ mounts. The one disadvantage is that the o2cb stack heartbeats on all mounts. That problem will be addressed in 1.8 (the tools will be released soon), with global heartbeat (hb volumes are user-configurable). Having said that, the number of volumes

Re: [Ocfs2-users] Trouble getting node to re-join two node cluster (OCFS2/DRBD Primary/Primary)

2011-09-15 Thread Sunil Mushran
open(/dev/drbd0, O_RDONLY|O_DIRECT) = -1 EMEDIUMTYPE (Wrong medium type) drbd_open() ... if (mdev-state.role != R_PRIMARY) { if (mode FMODE_WRITE) rv = -EROFS; else if (!allow_oos) rv = -EMEDIUMTYPE;

Re: [Ocfs2-users] Trouble getting node to re-join two node cluster (OCFS2/DRBD Primary/Primary)

2011-09-15 Thread Sunil Mushran
-F does not run the full fsck. -f does. But I would not recommend running fsck as this corruption is not normal. The inodes in the system directory have been overwritten. That typically means a storage issue. The fs does not create/remove inodes in sysdir. Only the tools do that. You may want

Re: [Ocfs2-users] disable heartbeat nic caused ocfs2 errors

2011-09-12 Thread Sunil Mushran
ocfs2 uses disk heartbeat to detect node liveness. It uses net heartbeat to detect link liveness. Both need to operate for the cluster to function. If the network link between two nodes snaps, then one of the two nodes is fenced. The stack below indicates that the two nodes are not able to

Re: [Ocfs2-users] node_count=0

2011-09-12 Thread Sunil Mushran
It is wrong config. On 09/09/2011 10:15 PM, Hai Tao wrote: I have a two node ocfs2 cluster, and in the /etc/ocfs2/cluster.conf file, the node_count=0 rather than 2. Does this have to be a wrong config, and how would this affect the cluster? Thanks. Hai Tao

Re: [Ocfs2-users] mount type heartbeat=local

2011-09-09 Thread Sunil Mushran
That's mount type. Yes, we should not have overloaded the term local. On 09/09/2011 07:53 PM, Hai Tao wrote: but this is what I saw in the guide OCFS2 - A Cluster File System For Linux http://oss.oracle.com/projects/ocfs2/dist/documentation/v1.4/ocfs2-1_4-usersguide.pdf *

Re: [Ocfs2-users] Linux kernel crash due to ocfs2

2011-09-08 Thread Sunil Mushran
.el5.ppc On Wed, 2011-09-07 at 09:13 -0700, Sunil Mushran wrote: version of ocfs2-tools? On 09/07/2011 09:10 AM, Betzos Giorgos wrote: Hello, I tried what you suggested but here is what I got: # o2image /dev/mapper/mpath0 /files_shared/u02.o2image *** glibc detected *** o2image: corrupted

Re: [Ocfs2-users] (mount.ocfs2, 3315, 4):ocfs2_global_read_info:403 ERROR: status = 24

2011-09-06 Thread Sunil Mushran
harmless. the message needs to be silenced. On 09/06/2011 01:31 PM, Stefan Priebe - Profihost AG wrote: Hi List, i've upgraded some machines to linux kernel from 2.8.38 to 3.0.4. Now i'm always seeing this message when mounting an ocfs2 volume: [ 38.745584]

Re: [Ocfs2-users] dlm locking bug?

2011-09-02 Thread Sunil Mushran
Log what you have in a bz. I can take a look. I doubt you will be able to attach that file though. You'll need to provide me with a link. On 09/02/2011 07:28 AM, Sérgio Surkamp wrote: Hello, We have got a problem this morning with our cluster. Cluster setup: Servers: * Two R800 Dell

Re: [Ocfs2-users] Max number of files in OCFS2 file system

2011-08-31 Thread Sunil Mushran
There is no such limit. You are running into a bug that has been fixed in mainline kernel 2.6.35 and is available with the UEK kernel. Upgrade to that kernel, install ocfs2-tools 1.6 and enable the discontig-bg feature. On 08/31/2011 03:15 PM, Omega Xtreme wrote: Hi All, Please I would like to

Re: [Ocfs2-users] Slow OCFS2 file creation / how to make ocfs2 generally faster?

2011-08-22 Thread Sunil Mushran
On 08/22/2011 09:08 AM, Stefan Priebe - Profihost AG wrote: HI, here are all values. Just a side note all machines had a fresh reboot. So these values are not right after the test. Network latency: cat /sys/kernel/debug/o2net/stats

Re: [Ocfs2-users] Lost ocfs mount point on one node between two shared servers

2011-08-22 Thread Sunil Mushran
On 08/22/2011 09:22 AM, Kalra, Pratima wrote: Hello All, Is it normal to lose a shared node in OCFS shared servers? We have either lost the whole ocfs mount point or lost mount point on one of the shared nodes couple of times. Is this due to some misconfiguration? Define lose? Could it

Re: [Ocfs2-users] Lost ocfs mount point on one node between two shared servers

2011-08-22 Thread Sunil Mushran
The user's guide explains all that. On 08/22/2011 09:34 AM, Kalra, Pratima wrote: It could be possible that it wasn't auto-mounted on reboot. Is there a separate setting for that? *From:*Sunil Mushran [mailto:sunil.mush...@oracle.com] *Sent:* Monday, August 22, 2011 9:27 AM *To:* Kalra

Re: [Ocfs2-users] IO performance appears slow

2011-08-19 Thread Sunil Mushran
o2net not equivalent to a simple ping between the hosts? Is my reported latency too great for OCFS2 to function well? Thanks for your assistance. -Nick *From:*Sunil Mushran [mailto:sunil.mush...@oracle.com] *Sent:* Thursday, August 18, 2011 10:26 PM *To:* Nick Geron *Cc:* ocfs2-users

Re: [Ocfs2-users] IO performance appears slow

2011-08-19 Thread Sunil Mushran
world? Is there a sweet spot for network latency that I should strive for? The user guide only makes mention of 'low latency' but lacks figures save for heartbeat and timeouts. -nick *From:*Sunil Mushran [mailto:sunil.mush...@oracle.com] *Sent:* Friday, August 19, 2011 2:30 PM *To:* Nick Geron *Cc

Re: [Ocfs2-users] sudden crash, possibly OCFS was the cause?

2011-07-22 Thread Sunil Mushran
The log is not complete. It is best to configure netconsole/kdump/etc to capture the full oops trace. Having said that, the following patch fits the issue best. Available in releases after 1.4.7. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commitdiff;h=adbd097b5bdc15c999bc04b16c6fba379cd5d3f2

Re: [Ocfs2-users] Slow umounts on SLES10 patchlevel 3 ocfs2

2011-07-14 Thread Sunil Mushran
Well, half a million on its own does not account for the time. But if one were to add heavily loaded servers, slower interconnect, high% of shared resources, the numbers could add up. I mean, this is a fairly old release. We have made improvements since then. Having said that, the biggest

Re: [Ocfs2-users] reset all ocfs2 data

2011-07-11 Thread Sunil Mushran
If you've rebooted, then there is not much more to do. # /sbin/lsmod | grep ocfs2 # egrep ocfs2|dlm /proc/slabinfo After shutting down o2cb, run the above commands. The first one lists the modules. The second lists the slabs. Both should show no entries. Did you file a bugzilla for this? If

Re: [Ocfs2-users] Slow umounts on SLES10 patchlevel 3 ocfs2

2011-07-07 Thread Sunil Mushran
It was designed to run in prod envs. On 07/07/2011 12:21 AM, Marc Grimme wrote: Sunil, can I query those figures during runtime of a productive cluster? Or might it influence the availability performance what ever? Thanks for your help. Marc. - Sunil Mushransunil.mush...@oracle.com

Re: [Ocfs2-users] inotify

2011-06-30 Thread Sunil Mushran
This is because we have not hooked up inotify to the cluster stack. On 06/30/2011 07:26 AM, Jeroen Koekkoek wrote: Hi, I'm running a 2 node OCFS2 + DRBD cluster to host maildirs. The IMAP server (Dovecot) uses inotify to track changes to the maildir, and informs the client when changes

Re: [Ocfs2-users] OCFS2 Crash

2011-06-29 Thread Sunil Mushran
You should ping your kernel vendor. While this does not look ocfs2 related, even if it did, you will be first asked to upgrade to a more recent kernel, etc. And all those bits will come from the vendor. On 06/29/2011 02:20 PM, B Leggett wrote: Sunril, After that first attempt I tried severla

Re: [Ocfs2-users] GPF when mounting second device in same cluster

2011-06-29 Thread Sunil Mushran
Strange. Both udevd and mount thread encountered issue in memory allocation routine. I would suggest you ping the kernel vendor. This looks more than just the fs. On 06/28/2011 04:36 PM, Richard Pickett wrote: Gents, OK, back to the single cluster, 2-node, w/ 3 devices. Here's my cluster.conf

Re: [Ocfs2-users] multiple cluster doesn't work

2011-06-28 Thread Sunil Mushran
On 06/28/2011 08:07 AM, Richard Pickett wrote: 1 Terabyte. We have 3 1Terabyte drives. They are already being replicated by the lower-layer architecture, so we don't need to raid them. We'd like to be able to use all three devices at the same time for archive purposes. I'm surprised to find

Re: [Ocfs2-users] multiple cluster doesn't work

2011-06-28 Thread Sunil Mushran
, Jr. www.MyHaitianAdoption.org http://www.MyHaitianAdoption.org P.S. Have you downloaded the journal from my trip to Haiti: http://www.myhaitianadoption.org/trips/journal-earthquake-rescue-jan-23-feb-2/ On Mon, Jun 27, 2011 at 9:05 PM, Sunil Mushran

Re: [Ocfs2-users] how to do rolling upgrade the ocfs2 2 node cluster along with new kernel without application outage.

2011-06-28 Thread Sunil Mushran
You have to be more specific than that. Maybe best if you ping support. On 06/28/2011 09:26 AM, veeraa bose wrote: Hi ALL, I have to do rolling upgrade on two node ocfs2 cluster, patch the server one by one, with out application outage. I tested in pre-prod 2 node cluster, once the DB is

Re: [Ocfs2-users] Heartbeat stays active stops o2cb shutdown

2011-06-28 Thread Sunil Mushran
@mercer.com_ mailto:chris.sh...@mercer.com _www.mmc.com_ http://www.mmc.com/ Working Hours: Mon-Fri: 8:00am-4:00pm AEST *From:* Sunil Mushran [mailto:sunil.mush...@oracle.com] *Sent:* Tuesday, 28 June 2011 3:16 AM

Re: [Ocfs2-users] multiple cluster doesn't work

2011-06-27 Thread Sunil Mushran
Whereas the cluster.conf allows users to define multiple clusters, only one cluster can be active at any time. The bug you ran into has probably been fixed. The link has been posted in the bz. Why do you need multiple clusters active concurrently? On 06/27/2011 04:44 PM, Richard Pickett wrote:

Re: [Ocfs2-users] Kernel oops: ocfs2_read_blocks

2011-06-24 Thread Sunil Mushran
How many nodes? Does it happen on all the nodes or one in particular? Are you running the same kernel version on all nodes? Did this issue start reproducing after some update? How often does it happen? Maybe best if you file a bugzilla on oss.oracle.com/bugzilla and answer the qs there. This

Re: [Ocfs2-users] ocfs2 with cman luster stack

2011-06-23 Thread Sunil Mushran
So this is ubuntu 11.04. The qs is is anyone using that distro/version gotten this to work. If not, then one possibility is a build issue. Maybe file a bug with ubuntu to see if they have tested it with their binaries. On 06/23/2011 01:55 PM, charles wrote: hello, i opened a bug on the ocfs2

Re: [Ocfs2-users] number of nodes is LUN dependent or cluster group dependent ?

2011-06-22 Thread Sunil Mushran
On 6/21/2011 9:41 PM, Thomas Lau wrote: As title, because LUN A on cluster group 1 is using N=8, if I have new LUN B, can I set number of node to something bigger and join same cluster group 1? Yes. ___ Ocfs2-users mailing list

Re: [Ocfs2-users] Unable to umount a filesystem - OCFS still thinks it has it mounted?‏

2011-06-17 Thread Sunil Mushran
Check /proc/mounts. That's the kernels view of the mounts. mount looks at /etc/mtab. And ocfs2 1.2 adds and removes entries /proc/fs/ocfs2 during mount/umount. Also, see if there are relevant errors in dmesg. On 06/16/2011 07:16 PM, Neil Campbell wrote: Hi all, Not sure what has happened

Re: [Ocfs2-users] Any suggestions how to copy between two OCFS2 volumes faster?

2011-06-15 Thread Sunil Mushran
Try dd bs=1M iflag=direct on few files. See if that helps. On 06/15/2011 01:00 PM, Ulf Zimmermann wrote: I need to copy a number of volumes from one SAN to another SAN. Most of our volumes are snapclone based, so moving those has been easy. But we got several 700GB volumes, which I can't

Re: [Ocfs2-users] ocfs2 slow write performance on Linux 2.6.38

2011-06-13 Thread Sunil Mushran
What type of writes are these... sequential or random? On 6/12/2011 5:37 PM, fibrer...@gmail.com wrote: Hello all, I am benchmarking OCFS2 in a single node environment to see how its performance stacks up against other Linux file systems. My hardware is dual CPU, 6-cores per CPU, 2.4GHz

Re: [Ocfs2-users] ocfs2 writing files bigger than 4MB

2011-06-08 Thread Sunil Mushran
Upgrade to a more recent kernel (2.6.35+). Upgrade ocfs2-tools to 1.6.x. Run tunefs.ocfs2 and enable feature discontig-bg. This will address this issue. On 06/08/2011 02:16 PM, Osvaldo Alvarez Pozo wrote: Hi I can not copy files bigger than 4Mbytes! I have an ocfs2 cluster with 4 nodes using

Re: [Ocfs2-users] Errors about a hole in an inode, not fixed by fsck.ocfs2

2011-06-08 Thread Sunil Mushran
On 06/08/2011 03:36 PM, Herman wrote: Hi all, Using: RHEL 6 / DRBD 8.3.10-2 kmod from ElRepo / OCFS2 compiled from Redhat's kernel source 2.6.32-71.18.2.el6.x86_64 I have a system running DRBD with OCFS2. The OCFS2 filesystem is not being used for databases. I had a split-brain due to

Re: [Ocfs2-users] cannot write to filesystem, permission denied?

2011-06-07 Thread Sunil Mushran
On 06/07/2011 05:01 AM, Sven Karlsson wrote: Hello, We have installed Fedora 15 to get the latest ocfs2 release nicely packaged in a 2.6.38 kernel and ocfs2-tools 1.6.3. Setup went fine, mkfs.ocfs2 went fine, a cluster was created and the local node added: # o2cb_ctl -C -i -n mycluster -t

Re: [Ocfs2-users] Problems with descriptions.

2011-06-02 Thread Sunil Mushran
That's the number of files open on the system. So this looks like an app problem. Some app has many files open. On 06/01/2011 10:37 PM, Vasyl S. Kostroma wrote: Hi guys! I can’t find an answer in google, so my last hope

Re: [Ocfs2-users] mkfs.ocfs2 optimal options for web server

2011-06-01 Thread Sunil Mushran
On 06/01/2011 02:03 AM, Alex Sobrino wrote: We're planning a three web server cluster based on OCFS2. Basically, it will handle a huge CMS, with lots of PHP code, and some file uploads (but mainly file reads). Initially, I was thinking in: - Block size 4K - Cluster size 4K - Node slots 3

Re: [Ocfs2-users] Large Files Hang Server

2011-05-25 Thread Sunil Mushran
If your apps do not care about atime, then noatime is helpful. data=writeback should performs better than data=ordered. But there is a small chance that files having trailing nulls if a node were to reboot after a journal commit but before a data flush. This is documented in the manpages and the

Re: [Ocfs2-users] Large Files Hang Server

2011-05-24 Thread Sunil Mushran
Repeat the same test but with volumes mounted with data=writeback mount option. mount -o data=writeback /dev/sdX /path On 05/24/2011 07:11 AM, Keith W wrote: Hello list. Apologies in advance, this may be a bit long. Just trying to give as much info as I can at the outset. I have a two node

Re: [Ocfs2-users] Large Files Hang Server

2011-05-24 Thread Sunil Mushran
/2011 11:45 AM, Keith W wrote: No change in behavior. My mount options /dev/sdj1 /u03ocfs2 _netdev,noatime,data=writeback,nointr 0 0 +---+ + Keith + +---+ On Tue, 24 May 2011, Sunil Mushran wrote

Re: [Ocfs2-users] Ocfs and ASM

2011-05-24 Thread Sunil Mushran
There should be no conflict. On 05/24/2011 11:32 AM, Keith W wrote: I have a lab system that is currently running Oracle RAC 11g with ASM volumes and grid infrastructure Is it possible to have an ocfs2 cluster running and accessing a different disk as well as the oracle clustering for RAC

Re: [Ocfs2-users] Large Files Hang Server

2011-05-24 Thread Sunil Mushran
the transfer. I am starting to think this is expected behaviour. Am I correct? +---+ + Keith + +---+ On Tue, 24 May 2011, Sunil Mushran wrote: Writeback will help if the writes are on one node and the ls on another

Re: [Ocfs2-users] OCFS2 1.6 for RHEL?

2011-05-17 Thread Sunil Mushran
On 05/17/2011 06:00 AM, Sven Karlsson wrote: * Joel Becker wrote: The latest ocfs2 is always in mainline. Any distro building a recent kernel an including ocfs2 will have it. As Sunil says, this includes Fedora, Debian, Ubuntu, and OpenSuSE. Whether ocfs2 is enabled in the CentOS kernel is

Re: [Ocfs2-users] 答复: 答复: hi,if i can post ocfs2-dmesg to you?i have trouble on ocfs2

2011-05-13 Thread Sunil Mushran
40f165f416bde747d85cdf71bc9dde700912f71f tags/v2.6.35-rc6~34^2~13] Would you give me a patch or URL. I only know kernel.org packag thanks *发件人:*Sunil Mushran [mailto:sunil.mush...@oracle.com] *发送时间:*2011年5月13日10:09 *收件人:*Longguang Yue *抄送:*ocfs2-users@oss.oracle.com *主题:*Re: 答复: hi,if i can post ocfs2-dmesg to you?i

Re: [Ocfs2-users] OCFS2 1.6 for RHEL?

2011-05-13 Thread Sunil Mushran
On 05/13/2011 03:13 AM, Kristian Jörg wrote: Hello! When is it planned ocfs2 1.6 will be available for RHEL? /Kristian No plans. Only OL/UEK. ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] OCFS2 1.6 for RHEL?

2011-05-13 Thread Sunil Mushran
Americas Shared Solutions Architecture (SSA) Hewlett-Packard Company 281 475 8632 / Tel kei...@hp.com / Email Reach the team at s...@hp.com -Original Message- From: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users-boun...@oss.oracle.com] On Behalf Of Sunil Mushran Sent

Re: [Ocfs2-users] fsck.ocfs2

2011-05-13 Thread Sunil Mushran
On 05/13/2011 11:44 AM, Xavier Diumé wrote: Hello, Is it possible to fsck a mounted filesystem. When one of the cluster nodes reboots because a kernel panic, the device requires fsck.ocfs2 because in mounted.ocfs2 -f rebooted node is shown. If mounted.ocfs2 -f shows the rebooted node, that

Re: [Ocfs2-users] OCFS2 1.6 for RHEL?

2011-05-13 Thread Sunil Mushran
. /Kristian Sunil Mushran skrev 2011-05-13 18:46: Support is a whole different ballgame. I am only talking about availability. And I interpreted that qs to be asking whether ocfs2 1.6 be available for the standard rhel kernels. And the answer is no. It will be available only for uek. On 05/13/2011 08

Re: [Ocfs2-users] How to change node name ?

2011-05-12 Thread Sunil Mushran
It is a manual process until 1.6. The upcoming release of tools (1.8) will allow online modification and removal. On 05/12/2011 05:02 AM, Thomas Lau wrote: Guys, how could I change node name and delete nodes after add into cluster? /Connected by MOTOBLUR™/

Re: [Ocfs2-users] 答复: hi,if i can post ocfs2-dmesg to you?i have trouble on ocfs2

2011-05-12 Thread Sunil Mushran
On 05/12/2011 06:50 PM, Longguang Yue wrote: Thank you first of all totally, there are 5 kinds of error occur. Spinlock leads to cpu lockup, o2net modules panic, kernel BUG at mm/slub.c:2969, BUG unable to handle kernel paging request at addr My environment: kernel-2.6.32.23 + xen +

Re: [Ocfs2-users] Server hang after error

2011-05-09 Thread Sunil Mushran
Your config is sufficient. Hard to say why it did not reboot. Ping the debian mailing list to see if there are reports of the same on whatever kernel you are on. As far as the reason for it goes, there should have been a message just prior to the Kernel Panic message. Most likely reason is that

Re: [Ocfs2-users] read/write performance across cluster

2011-05-04 Thread Sunil Mushran
On 05/04/2011 09:56 AM, Florin Andrei wrote: On 05/04/2011 09:44 AM, Srinivas Eeda wrote: Yes, there is locking involved. Extending a file needs an exclusive lock. Grepping a file needs read lock. If the same node(lets call it writer node) does extending and grepping, then grep already has a

Re: [Ocfs2-users] Kernel Feature List?

2011-05-03 Thread Sunil Mushran
man mkfs.ocfs2 is better. On May 3, 2011, at 6:24 PM, Tiger Yang tiger.y...@oracle.com wrote: On 05/02/2011 03:57 PM, Stefan Priebe - Profihost AG wrote: Hi, is there a list available which ocfs2 feature is available at which vanilla kernel version? Stefan Hi, There is one list for

Re: [Ocfs2-users] can't mount device

2011-04-22 Thread Sunil Mushran
Is this during boot or is the mount manual? Does it succeed on second attempt? On 04/22/2011 06:33 AM, Christophe BOUDER wrote: Hello, i'm running ocfs2 on 27 nodes with 2 devices ( 2 fiber channel disk array storage) on debian system vanilla kernel 2.6.38.2 ocfs2-tools1.6.3-1

Re: [Ocfs2-users] How long for an fsck?

2011-04-21 Thread Sunil Mushran
On 04/21/2011 06:43 AM, Josep Guerrero wrote: I have a cluster with 8 nodes, all of them running Debian Lenny (plus some additions so multipath and Infiniband works), which share an array of 48 1TB disks. Those disks form 22 pairs of hardware RAID1, plus 4 spares). The first 21 pairs are

Re: [Ocfs2-users] Problem mounting volume on second node after uncontrolled reboot.

2011-04-13 Thread Sunil Mushran
Could be that the device has been claimed by some other entity. Like lvm, multipath, etc. On Apr 13, 2011, at 4:08 AM, Asle Næss asle.na...@telio.no wrote: Hi, I have the following setup: 2 linux nodes connected to shared SAN, both nodes have shared OCFS2 volume as /dev/sdb1 Both

Re: [Ocfs2-users] o2cb stack and kernel = 2.6.37

2011-04-08 Thread Sunil Mushran
On 04/08/2011 08:01 AM, Werner Flamme wrote: Joel, I think I encountered otherwise :-( In our OCFS2 cluster there are up to 15 active nodes. 7 of them were running yesterday (2 with Oracle Linux, 5 with SLES 11 SP1 + SLE HAE SP1). When applying the last patches of the SLE HAE, the patched

Re: [Ocfs2-users] Error when fsck.ocfs2

2011-04-08 Thread Sunil Mushran
On 04/08/2011 09:25 AM, Mauro Parra wrote: Hello, I'm getting this error: $fsck.ocfs2 /dev/mapper/360a98000572d434e4e6f6335524b396f_part1 fsck.ocfs2: File not found by ocfs2_lookup while locking down the cluster Any idea or hint? This means it failed to lookup the journal in the system

Re: [Ocfs2-users] Error when fsck.ocfs2

2011-04-08 Thread Sunil Mushran
So that's appears to be ok. How about a strace. strace -o /tmp/fsck.out fsck.ocfs2 /dev/mapper/360... On 04/08/2011 10:24 AM, Mauro Parra wrote: debugfs.ocfs2 -R ls -l // /dev/mapper/360a98000572d434e4e6f6335524b396f_part1 514 drwxr-xr-x 6 0 04096

Re: [Ocfs2-users] Error when fsck.ocfs2

2011-04-08 Thread Sunil Mushran
Ahh... journal: is missing. And so is local_alloc:. That's inode#s 532 and 536. OCFS2 does not remove system files from the kernel. The sysfiles are removed only by tunefs.ocfs2 and that too when one is reducing the number of slots. But then too it reduces the higher slot numbers. is

Re: [Ocfs2-users] Nodes losing connection

2011-04-05 Thread Sunil Mushran
A 7 second timeout is very low. We default to 30 secs. And depending on your setup, you could easily increase it to 60 secs. On 04/05/2011 12:20 AM, Marc Kowal wrote: Hi all, we are currently running a three node Moodle/Apache cluster with OCFS2 as upload directory. Everything is fine, but

Re: [Ocfs2-users] o2cb stack and kernel = 2.6.37

2011-04-04 Thread Sunil Mushran
What problem are you encountering? That protocol change is backward compatible. Atleast was in our testing. On Apr 4, 2011, at 6:20 AM, Jacek Stępniewski jacek.stepniew...@agora.pl wrote: Hello Is there anybody who uses o2cb stack with mixed kernels (2.6.37 or more and less than

Re: [Ocfs2-users] o2cb stack and kernel = 2.6.37

2011-04-04 Thread Sunil Mushran
On 04/04/2011 10:08 AM, Jacek Stępniewski wrote: At protocol negotiation 1.0 is selected (rest of nodes are pre 2.6.37 with dlm 1.0). But connecting node (with kernel= 2.6.37) checks builtin version, not negotiated one. I have found this code in dlmdomain.c:1615 (function:

Re: [Ocfs2-users] Anomaly when writing to OCFS2

2011-04-01 Thread Sunil Mushran
Are you using any mount options? On 04/01/2011 01:12 AM, John Gardner wrote: Hi everyone We have a problem when writing a some files to OCFS2 and I hope perhaps some of you may be able to help. I'll outline our system first. We have a Oracle RAC system about to go into service. There are

Re: [Ocfs2-users] Node Recovery locks I/O in two-node OCFS2 cluster (DRBD 8.3.8 / Ubuntu 10.10)

2011-04-01 Thread Sunil Mushran
I believe this is a pacemaker issue. There was a time it required a qdisk to continue working as a single node in a 2 node cluster when one node died. if pacemaker people don't jump in, you may want to try your luck in the linux-cluster mailing list. On 04/01/2011 11:44 AM, Mike Reid wrote: I

Re: [Ocfs2-users] Anomaly when writing to OCFS2

2011-04-01 Thread Sunil Mushran
On 04/01/2011 11:42 AM, John Gardner wrote: Sunil Are you using any mount options? Just the default _netdev option. I did have some more options but remounted with a minimal _netdev to try and solve the problem. John So the file is written to one directory and copied to another. While

Re: [Ocfs2-users] Anomaly when writing to OCFS2

2011-04-01 Thread Sunil Mushran
On 04/01/2011 12:42 PM, John Gardner wrote: Sunil So the file is written to one directory and copied to another. While it is ok in the first location, it is seen to be 0 bytes in the second location. Is it copy or move? Who is doing this? Some script? Oracle software? Are you running short

Re: [Ocfs2-users] Anomaly when writing to OCFS2

2011-04-01 Thread Sunil Mushran
On 04/01/2011 01:14 PM, John Gardner wrote: So everything works fine as long as the temp file is being written to. The out file is the issue. Not exactly, the temp file is always written to whether the temp file is located on OCFS2 or sym linked to the EXT3 partition. The problem occurs

Re: [Ocfs2-users] Best practice to add new node

2011-03-30 Thread Sunil Mushran
On 03/29/2011 07:15 PM, Thomas Lau wrote: I am trying to add node from one of the active node, after use o2cb_ctl, node added successfully, but even I transfer cluster.conf to another active node, do I need to restart o2cb to make sure all other active nodes are being receive the new node add

Re: [Ocfs2-users] nfsd hanging with ocfs2 1.4.7...

2011-03-25 Thread Sunil Mushran
Are you mount with nordirplus? For more refer to this email. http://oss.oracle.com/pipermail/ocfs2-announce/2008-June/25.html On 03/25/2011 08:49 AM, James Abbott wrote: Hello, I've recently setup an ocfs2 volume via a 4Gb/s SAN which is directly mounted on two CentOS 5.5 machines

Re: [Ocfs2-users] bug resolve yet for export OCFS2 volume to NFS client ?

2011-03-22 Thread Sunil Mushran
On 03/21/2011 11:52 PM, Thomas Lau wrote: I found this from ocfs1.4 document: g) NFS OCFS2 volumes can be exported as NFS volumes. This support is limited to NFS version 3, which translates to Linux kernel version 2.4 or later. Users must mount the NFS volumes on the clients using the

Re: [Ocfs2-users] Different disksizes on same disks between cluster nodes

2011-03-22 Thread Sunil Mushran
Can you run df without any arguments? Could be a rounding issue in df. Still unsure how ocfs2 install/deinstall could trigger that. On 03/22/2011 01:52 AM, Kristiansen Morten wrote: Hi, Resently we resinstalled our four node cluster and tried to upgrade ocfs2 from 1.2.9 to 1.4.7. When two

Re: [Ocfs2-users] bug resolve yet for export OCFS2 volume to NFS client ?

2011-03-22 Thread Sunil Mushran
Last I checked readdirplus was not in nfs4. But you may want to reconfirm. readdirplus is a rpc call initiated by the client. So has to be disabled on the nfs client. On 03/22/2011 06:13 PM, Thomas Lau wrote: is it limited to NFSv3 or NFSv4 or both NFS? out of topic question, how could I

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node. If other nodes are rebooting then I suspect some sysctl values

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
On 03/18/2011 04:56 PM, Nikola Savic wrote: Sunil Mushran wrote: This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just

Re: [Ocfs2-users] What could cause slow down betwen OCFS2 1.2.9 and 1.4.4

2011-03-11 Thread Sunil Mushran
Verifying the journal mode is easy enough. Remount with data=writeback. It can be done one node at a time. But since you upgraded from 4.5 to 5.5, you may have to cast a wider net considering the entire kernel also changed. On 03/11/2011 02:22 PM, Ulf Zimmermann wrote: We upgraded our

Re: [Ocfs2-users] node eviction

2011-03-04 Thread Sunil Mushran
This is from one of the surviving node. As to why a node died will be known by looking at the netconsole logs of the dead node. On 03/04/2011 02:01 PM, Garcia, Raymundo wrote: Hello... I wonder if someone have had similar problem like this... a node evicts almost in a weekly basis and I have

Re: [Ocfs2-users] OCFS2 1.4 + DRBD + iSCSI problem with DLM

2011-03-02 Thread Sunil Mushran
This is not a coherent shared disk environment. The iscsi target has no idea that the device is also being updated from another source. On 03/02/2011 04:30 PM, Nikola Savic wrote: Hello, I have 3 node setup using CentOS 5.5 and OCFS2 1.4. Disks from two nodes (node1 and node2) are

Re: [Ocfs2-users] Problem with mkfs.ocfs2

2011-03-02 Thread Sunil Mushran
The qs was regarding mkfs not fsck. On 03/01/2011 11:41 PM, Thomas Lau wrote: Does it means OCFS2 don't require to be offline to do fsck ? On 02/03/2011 2:43 PM, Sunil Mushran wrote: The real error should be in dmesg. Guessing it was unable to connect to all nodes. On Mar 1, 2011, at 9:23

Re: [Ocfs2-users] Problem with mkfs.ocfs2

2011-03-01 Thread Sunil Mushran
The real error should be in dmesg. Guessing it was unable to connect to all nodes. On Mar 1, 2011, at 9:23 PM, Curley, Rob rob.cur...@windriver.com wrote: When I try and run the mkfs.ocfs2 command after I have updated all of the other nodes in the cluster with the new node information in

Re: [Ocfs2-users] How to fsck.ocfs2

2011-02-22 Thread Sunil Mushran
- Hash: SHA1 Werner Flamme [22.02.2011 13:55]: Sunil Mushran [22.02.2011 03:40]: If fsck completed successfully, then o2image is not required. ---snip--- # fsck.ocfs2 -fy /dev/sdf 21 | tee /ocfs2/sap02/fsck_sap01_hot.txt fsck.ocfs2 1.4.3 Checking OCFS2 filesystem in /dev/sdf: Label

Re: [Ocfs2-users] How to fsck.ocfs2

2011-02-21 Thread Sunil Mushran
If fsck completed successfully, then o2image is not required. On 02/21/2011 08:24 AM, Werner Flamme wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Werner Flamme [21.02.2011 16:01]: Sunil, finally, fsck.ocfs2 finished. The size of the protocol is 8404218 bytes, and in my mind I do

Re: [Ocfs2-users] How to fsck.ocfs2

2011-02-18 Thread Sunil Mushran
Run with -fn. It is non destructive. Also use the latest fsck. 1.4.4 I believe. On Feb 18, 2011, at 8:46 AM, Werner Flamme werner.fla...@ufz.de wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi everyone, simple question: how can I execute a fsck.ocfs2 /dev/sdf? All I get is

Re: [Ocfs2-users] How to fsck.ocfs2

2011-02-18 Thread Sunil Mushran
size? Regards, Werner Sunil Mushran [18.02.2011 17:59]: Run with -fn. It is non destructive. Also use the latest fsck. 1.4.4 I believe. On Feb 18, 2011, at 8:46 AM, Werner Flammewerner.fla...@ufz.de wrote: Hi everyone, simple question: how can I execute a fsck.ocfs2 /dev/sdf? All I get

Re: [Ocfs2-users] Question about eth communication for ocfs2

2011-02-11 Thread Sunil Mushran
. whereas it was starting when using ocfs2 rpms 1.4.3-3 with same releases of pacemaker and corosync. Any idea ? Thanks a lot. Alain Sunil Mushran a écrit : On 02/08/2011 01:32 AM, Alain.Moulle wrote: OK but what I wonder now is : is OCFS2 really capable of fencing an adjacent node

Re: [Ocfs2-users] Determining which version of ocfs2 tools a filesystem was created with.

2011-02-08 Thread Sunil Mushran
On 02/07/2011 10:41 PM, Mikey Austin wrote: We have been using the ocfs2 tools (version 1.4.4) on gentoo (2.6.18-164.15.1 kernel) for around 6 months now. We would like to re-create all of the ocfs2 filesystems that were not created with mkfs.ocfs2 version 1.4.4, however we are not able to

Re: [Ocfs2-users] Question about eth communication for ocfs2

2011-02-08 Thread Sunil Mushran
On 02/08/2011 01:32 AM, Alain.Moulle wrote: OK but what I wonder now is : is OCFS2 really capable of fencing an adjacent node ? or is it only capable of node self-fencing ? I thought that ocfs2 was only capable of node self-fencing because there is no configuration of any fencing device (i.e.

Re: [Ocfs2-users] Ofcs2 Questions!

2011-02-02 Thread Sunil Mushran
version? distro? This workload will benefit a lot with the indexed directories available in ocfs2 1.6 (and mainline and sles11). The other thing to check is the amount memory in the virtual machines. File systems need memory to cache the inodes. If memory is lacking, the inodes are freed and

Re: [Ocfs2-users] Ofcs2 Questions!

2011-02-02 Thread Sunil Mushran
? []'sf.rique On Wed, Feb 2, 2011 at 11:54 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: # grep Resources /sys/kernel/debug/o2dlm/UUID/dlm_state Lock Resources: 20713 (1139459) The first number is number of live resources. Second the number

Re: [Ocfs2-users] crash on mount

2011-01-29 Thread Sunil Mushran
fsck.ocfs2 once the file system is unmounted. do i need an other fsck.ocsf2 directly ? On Fri, 28 Jan 2011 15:04:52 -0800, Sunil Mushran wrote: Dump the following: debugfs.ocfs2 -R stat /dev/sdX This will save the errant inode. Then run fsck -fy. Hopefully that will fix

Re: [Ocfs2-users] node reboot during rsync

2011-01-28 Thread Sunil Mushran
Known issue. Was fixed many moons ago. Ping support for a new kernel with the fix. On 01/28/2011 12:14 AM, Cristian Gae wrote: Hello One of our 5 nodes cluster has rebooted during a rsync process between two ocfs2 v1.6 volumes, both mounted on this node. All nodes have the same OS, kernel,

Re: [Ocfs2-users] Backup strategy for ucm on ocfs shared file system

2011-01-28 Thread Sunil Mushran
A popular way is to use ro snapshots provided by the storage arrays. On 01/28/2011 03:09 PM, Kalra, Pratima wrote: Hello, What would be the suggested backup strategy for taking a backup of ucm ocfs mount point on a regular basis? Thanks, *Pratima Kalra* (916) 795-3833

Re: [Ocfs2-users] extreme Mod time differences

2011-01-19 Thread Sunil Mushran
This is mentioned in the doc/FAQ. Ocfs2 does not update the mtime on disk for non-extending directio writes. The correct times you are seeing are only in cache. We do this to allow multiple nodes to r/w concurrently to the same file. Having said that, in mainline we have recently added a mount

Re: [Ocfs2-users] ping_pong using o2cb and cman

2011-01-18 Thread Sunil Mushran
ping_long tests the fcntl() user locks. ocfs2 supports clustered fcntl() locking with cman and pacemaker cluster stacks. Not with o2cb. ocfs2 supports clustered flock() with all stacks. o2cb, cman and pacemaker. On 01/17/2011 07:25 AM, Dan Warner wrote: I was testing ocfs2 on a 2 node cluster

Re: [Ocfs2-users] Unable to build RPM ocfs2 1.2.9 for rhel4

2011-01-18 Thread Sunil Mushran
Do: # ./configure --with-kernel=/path/to/2.6.9-89.33.1.EL/devel/tree before # make rhel4_2.6.9-89.33.1.EL_rpm On 01/17/2011 07:36 AM, David Coulson wrote: I'm trying to build 1.2.9 for RHEL4 using the supplied make files, but it is not working for some reason - The src.rpm it is creating does

Re: [Ocfs2-users] ping_pong using o2cb and cman

2011-01-18 Thread Sunil Mushran
() locking within o2cb, on the wishlist of features to be implemented at a future date? Is it even a priority to do? Thanks, Dan On 18 January 2011 18:00, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: ping_long tests the fcntl() user locks. ocfs2 supports

Re: [Ocfs2-users] recive error while mounting linux partation using ocfs2

2010-12-19 Thread Sunil Mushran
Which distro? The datavolume mount option is only available for enterprise distros. On Dec 18, 2010, at 6:52 PM, zeeshan jahangir zeeshan4...@gmail.com wrote: hi, mount -t ocfs2 -o datavolume,nointr -L oracrsfile /u02 when i mount linux partation using above command i recieve the

Re: [Ocfs2-users] recive error while mounting linux partation using ocfs2

2010-12-19 Thread Sunil Mushran
what does dmesg say? On 12/19/2010 07:42 PM, zeeshan jahangir wrote: well i am using red hat enterprise linux 5.1 (kernel 2.6.18-128.el5xen) and use openfiler 2 as a storage in which i have created partation and map it. On 12/20/10, Sunil Mushransunil.mush...@oracle.com wrote: Which

<    1   2   3   4   5   6   7   8   9   10   >