Re: [Ocfs2-users] re: ocfs2 filesystem and space usage?

2007-04-20 Thread Sunil Mushran
# wget http://oss.oracle.com/~smushran/.debug/scripts/stat_sysdir.sh # chmod +x stat_sysdir.sh # ./stat_sysdir.sh -d sdb1 /tmp/sdb1.out Email sdb1.out Peter Santos wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Actually, I just found some small log files on that same file system that

Re: [Ocfs2-users] Single node setup

2007-04-20 Thread Sunil Mushran
Yes, this would be a good list for ocfs2 related qs. Why are the voting disk and ocr on different ocfs2 volumes/disks? They can exist on the same volume. As in , it appears you are using the raw paradigm of file == partition and you don't have to. Now say you have different volumes for both

Re: [Ocfs2-users] Can't mount OCFS2 filesytem

2007-04-18 Thread Sunil Mushran
Do: # cat /proc/partitions # cat /etc/fstab # mounted.ocfs2 -d Email the outputs. Wang Steven wrote: Hi All, I can't mount one of my OCFS2 filesystem even though the other three are running fine. Is there a easy way to get the filesystem mounted and then I can overwrite the files? OS:

Re: [Ocfs2-users] Can't mount OCFS2 filesytem

2007-04-18 Thread Sunil Mushran
ocfs2 2907032b-ca76-4754-a60e-b6ca1e5dc670 origcrsdata [EMAIL PROTECTED] ~]# -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, 19 April 2007 9:11 AM To: Wang Steven Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Can't mount OCFS2 filesytem

Re: Antw: Re: [Ocfs2-users] Bugzilla 856

2007-04-17 Thread Sunil Mushran
]$ make rhel4_2.6.9-42.0.10.ELsmp_rpm make: *** No rule to make target `rhel4_2.6.9-42.0.10.ELsmp_rpm'. Stop. [EMAIL PROTECTED] ocfs2-1.2.5]$ make rhel4_2.6.9-42.0.10.EL_rpm make: *** No rule to make target `rhel4_2.6.9-42.0.10.EL_rpm'. Stop. [EMAIL PROTECTED] ocfs2-1.2.5]$ Sunil Mushran [EMAIL

Re: [Ocfs2-users] RPM packages for RHEL 5

2007-04-17 Thread Sunil Mushran
It's being tested. Daniel wrote: Hello When can we expect RPM packages for RHEL 5? -- Daniel ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] mount.ocfs2 blah

2007-04-17 Thread Sunil Mushran
fs_generation is randomly generated during mkfs and is added to each inode. fsck uses it to identify current inodes (and not confuse with inodes existing from before the mkfs). The error suggests that the root inode has a different fs_generation that what the superblock suggests. Unsure how

Re: [Ocfs2-users] O2CB in ocfs2tools 1.2.4

2007-04-13 Thread Sunil Mushran
, /etc/sysconfig/o2cb and /etc/init.d/o2cb are different. Latter is the init script whereas the former is the config file. File a bugzilla. We'll see what we can do. Randy Ramsdell wrote: Sunil Mushran wrote: Have you installed ocfs2-tools? o2cb init script is included in it. BTW 1.2.2

Re: [Ocfs2-users] OCFS2 Fencing, then panic

2007-04-11 Thread Sunil Mushran
Use private. enohi ibekwe wrote: The IP address on the cluster.conf file is the public IP address for the nodes. Original Message Follows From: Sunil Mushran [EMAIL PROTECTED] To: enohi ibekwe [EMAIL PROTECTED] CC: [EMAIL PROTECTED], [EMAIL PROTECTED], ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] Catatonic nodes under SLES10

2007-04-10 Thread Sunil Mushran
Patches are welcome. :) Alexei_Roudnev wrote: If I know, that FS was not active for the last (for example) 30 seconds, and all buffers are written back more than 30 secoinds ago, I DO KNOW that there is not any IO in the pipelines. At least it can be configurable. 30 seconds - ok, may be

Re: [Ocfs2-users] Get error on mount.ocfs2 No such file or directory while mounting ....

2007-04-09 Thread Sunil Mushran
mkdir /u01 or /u02 As in, it appears you are missing the mount directory. Zosen Wang wrote: I try to install 2 nodes RAC in Linux 2.6.9.-22.EL by using Jeffery Hunter’s paper. I am getting problem to mount the ocfs2 file system. The following is mount command error output: [EMAIL PROTECTED]

Re: [Ocfs2-users] Catatonic nodes under SLES10

2007-04-09 Thread Sunil Mushran
, wait 1 more minute and switch to passive mode. - Original Message - From: Sunil Mushran [EMAIL PROTECTED] To: David Miller [EMAIL PROTECTED] Cc: ocfs2-users@oss.oracle.com Sent: Monday, April 09, 2007 3:18 PM Subject: Re: [Ocfs2-users] Catatonic nodes under SLES10 For io fencing

Re: [Ocfs2-users] Catatonic nodes under SLES10

2007-04-09 Thread Sunil Mushran
failures anyway. And reboots on each _ap chi_ cause more problems then bring benefits (except when OCFSv2 is used for critical data in the 100% time write mode). - Original Message - From: Sunil Mushran [EMAIL PROTECTED] To: Alexei_Roudnev [EMAIL PROTECTED] Cc: David Miller [EMAIL

Re: [Ocfs2-users] High on buffers and deep on swap

2007-04-09 Thread Sunil Mushran
Not sure whether I would call that high. It is a 5G box afterall. It is always hard to figure out memory related issues based on one output. Call Oracle support. They will give you an oswatcher script that monitors various stats giving us a much better view into the running system.

Re: [Ocfs2-users] OCFS2 Fencing, then panic

2007-04-06 Thread Sunil Mushran
You will have to provide more information. If you have a netconsole server configured, it would have the details. Else, I would recommend you configure one to catch the messages during fence. We have to see the deduce for the fence to determine the actual problem. enohi ibekwe wrote: Is this

Re: [Ocfs2-users] HowTo recover ocfs2 in a 10g four node cluster

2007-03-30 Thread Sunil Mushran
ocfs2 init script is mounting devices listed in /etc/fstab. Check the device names. If you are mounting by device name, the name may have changed. If so, fix the device name and also look up mount by label in the docs. John E wrote: Hi All, I needed to rebuild the operating system on one of

Re: [Ocfs2-users] 1.2.4 still eating memory??

2007-03-29 Thread Sunil Mushran
How long does it take for the node to die? File a new bugzilla with the following info. date/tmp/info.txt iostat -x 1 3 /tmp/info.txt vmstat 1 3 /tmp/info.txt top -b -n 1 | head -50 /tmp/info.txt ps -elf /tmp/info.txt cat /proc/meminfo

Re: [Ocfs2-users] ocfs2_file_sendfile: 372 ERROR

2007-03-26 Thread Sunil Mushran
#define ECONNRESET 104 /* Connection reset by peer */ #define EPIPE 32 /* Broken pipe */ Harmless. But do file a bug. oss.oracle.com/bugzilla. We should not be printing the ERROR. It should be handled by the userspace. Stephan Hendl wrote: Hi, I'm using a 4 node

Re: [Ocfs2-users] make error on 2.6.20

2007-03-22 Thread Sunil Mushran
The kernel has changed much. 1.2 will not build against 2.6.20. We have updated autogen to handle rhel5, but that's still 2.6.18. What are you trying to achieve? Why not use the ocfs2 modules shipped natively with fc6? If you want to run database, specify filesystemio_options=odirect in

Re: [Ocfs2-users] OCFS2 + DRBD 0.81 Casch

2007-03-20 Thread Sunil Mushran
The patch fix for this missed the 2.6.20 window. The following link has all the relevant patches atop 2.6.20. http://git.kernel.org/?p=linux/kernel/git/mfasheh/ocfs2.git;a=log;h=2.6.20_fixes Apply all in order starting from one after the official 2.6.20. Incidentally the fix you require is the

Re: [Ocfs2-users] ocfs2 is still eating memory

2007-03-16 Thread Sunil Mushran
in memory? That's the entire point of caches. If we start OOM killing processes due to the caches taking all the memory, that's absolutely a bug. Here is what Sunil Mushran from Oracle had to say about the issue: Well, kswapd is supposed to flush the caches. As in, the vm controls

Re: [Ocfs2-users] ocfs2 v.1.2.5 question

2007-03-15 Thread Sunil Mushran
Yes. 1.2.5 will have the configurable network timeout. Randy Ramsdell wrote: Hi, We are planning an upgrade of an ocfs2 cluster and I wanted to clarify something first. Is 1.2.5 going to include a variable to set the network timeout? This seems to be important as we have had to move processes

Re: [Ocfs2-users] ocfs2 cluster becomes unresponsive

2007-03-13 Thread Sunil Mushran
Have you tried to do alt-sysrq-t on the dead node? The stack traces will be of great help. Also, even though this could be the same as #819, I would still recommend filing a new bug with all the messages files. Even though that will take some of your time, it will be much easier to keep track

Re: [Ocfs2-users] data volume option, is it present in current version of ocfs2

2007-03-13 Thread Sunil Mushran
holds support for it?Any other ways of using ocfs2? thanks for the response regards, Nirmal Tom. From: Sunil Mushran [EMAIL PROTECTED] To: nirmal tom [EMAIL PROTECTED] CC: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] data volume option,is it present in current version of ocfs2

Re: [Ocfs2-users] ocfs2 cluster becomes unresponsive

2007-03-10 Thread Sunil Mushran
The config error I would imagine would be that you defined two different clusters, each not having the other node, and that the two nodes have the same node number in both clusters. If so, the disk hb would have detected this error. It would have spewed error messages indicating that some other

Re: [Ocfs2-users] ocfs2 is still eating memory

2007-03-09 Thread Sunil Mushran
harsh and disrespectful to say the least. Which is never really appreciated. A little bit of respect and more constructive feedback usually goes a very long way. Everyone is tryong their best. -Original Message- From: Alexei_Roudnev [EMAIL PROTECTED] To: Sunil Mushran [EMAIL PROTECTED

Re: [Ocfs2-users] ocfs2 is still eating memory

2007-03-08 Thread Sunil Mushran
If you are running a prod shop, you should looking into buying support. John Lange wrote: On Mon, 2007-03-05 at 13:46 -0800, Sunil Mushran wrote: Well, kswapd is supposed to flush the caches. As in, the vm controls the lifetime of the inodes in the inode_cache not ocfs2. All ocfs2 can do

[Ocfs2-users] OCFS2 Tools 1.2.3 released

2007-03-02 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 Tools 1.2.3. This release is fully compatible with the OCFS2 1.2.x and the OCFS2 bundled with the mainline Linux kernel 2.6.20 (and earlier). The summary of changes in this release are as follows: * Backup super block support added * Local

Re: [Ocfs2-users] growing a ocfs2 filesystem

2007-02-28 Thread Sunil Mushran
Means that the volume is in use on at least one node in the cluster. If you were using the native o2cb heartbeat, the following command would have shown the heartbeating node. # watch -d -n2 debugfs.ocfs2 -R \hb\ /dev/sdX Ping SUSE to find out the details when using ocfs2 with linux-ha.

Re: [Ocfs2-users] dlm timeouts and following errors -112

2007-02-26 Thread Sunil Mushran
Yes, the messages are related. -112 is EHOSTDOWN. Sebastian Reitenbach wrote: Hi list, I am experimenting with ocfs2 (rpm package: 1.2.2-0.2), using linux-ha 2.0.8 (all running on a SLES 10 x86-64, rpm packages from linux-ha.org) for the heartbeat. The three nodes are connected on a gigabit

Re: [Ocfs2-users] High on buffers and deep on swap

2007-02-26 Thread Sunil Mushran
is 2.6.9-34.ELsmp (RedHat 4.0) as this is our production cluster and we did not update it recently. Regards, Luis */Sunil Mushran [EMAIL PROTECTED]/* wrote: Hmmm... the last time I saw your numbers, ocfs2's foot print was 15M. You'll have to do better than that. Anycase, Luis problem

Re: [Ocfs2-users] Problems with ocfs2 when rebooting the first node.

2007-02-26 Thread Sunil Mushran
Check out this bug: http://oss.oracle.com/bugzilla/show_bug.cgi?id=854 José Costa wrote: Hello, I'm using 2.6.16.41-SLES10_SP1_BRANCH_20070220135926-smp with OCFS2 1.2.4. If I start the node1 and then the node2... everything works. If I reboot the node1, it gives this error to node2 and I

Re: {Spam?} Re: [Ocfs2-users] Performance Problems while reading

2007-02-23 Thread Sunil Mushran
Egon Burgener wrote: And you are convinced that drdb's primary-primary is not the cause for the slowdown. ?? Yes, writing a file is fast. Reading a file has no influence on drbd. We noticed, that reading a big file on one node while the other node opened that file in RW mode but without

Re: [Ocfs2-users] relation between ocfs2 1.2.4-2 and kernel.org GIT HEAD

2007-02-20 Thread Sunil Mushran
OCFS2 has two trees. The 1.2 tree and the git tree. All new development happens on git head. All bug fixes are typically worked on the tree that it was detected on. Later, the bug fix is applied to the other tree. As most of our users are using the 1.2 tree, almost all bug fixes flow from the

Re: [Ocfs2-users] ocfs2 with user based heartbeat

2007-02-16 Thread Sunil Mushran
That's probably dlm communication. You should be able to confirm that using ethereal/wireshark. http://oss.oracle.com/~smushran/.debug/wireshark/ Sebastian Reitenbach wrote: Hi list, I just have a quick question. We are experimenting with ocfs2 and linux heartbeat, using user based heartbeat

Re: [Ocfs2-users] Re: [Linux-HA] OCFS2 - Memory hog?

2007-02-15 Thread Sunil Mushran
Fixed in 1.2.4. SUSE has the patch-fix. The patch has also been added to mainline. John Lange wrote: Yes, the clients are doing lots of creates. But my question is, if this is a memory leak, why does ocfs2 eat up the memory as soon as the clients start accessing the filesystem. Within about

Re: [Ocfs2-users] 2 OCFS2 clusters that affect each other

2007-02-15 Thread Sunil Mushran
Do you have the full oops trace? Nathan Ehresman wrote: I have a strange OCFS2 problem that has been plaguing me. I have 2 separate OCFS2 clusters, each consisting of 3 machines. One is an Oracle RAC, the other is used as a shared DocumentRoot for a web cluster. All 6 machines are in an

Re: [Ocfs2-users] segfault on 1.2.4

2007-02-14 Thread Sunil Mushran
I meant the solution mentioned by Mark is listed on the ocfs2 home page. Randy Ramsdell wrote: I configured a new cluster using 1.2.4-2. This was a custom install by compiling source. Any ideas or questions? The segmentation fault produced with rm -rf or any other rm switch : NOTE: This does

Re: [Ocfs2-users] segfault on 1.2.4

2007-02-14 Thread Sunil Mushran
Yes, this is mentioned on the ocfs2 home page. Randy Ramsdell wrote: I configured a new cluster using 1.2.4-2. This was a custom install by compiling source. Any ideas or questions? The segmentation fault produced with rm -rf or any other rm switch : NOTE: This does not happen on a local

Re: [Ocfs2-users] ocfs2console

2007-02-13 Thread Sunil Mushran
It's probably because you are missing some package. See the FAQ for the list of packages it is dependent on. Randy Ramsdell wrote: Hi, I see that the ocfs2console app for 1.2.2 doesn't have the same menu items as does the 1.1.0 package. Is the propagate config, check and repair going to be

Re: [Ocfs2-users] 1.2.4 symbols

2007-02-09 Thread Sunil Mushran
What does dmesg say? Randy Ramsdell wrote: Hi, Everything compiled correctly for the ocfs2 package, but so far the modules will not load with the well known module symbol error. FATAL: Error inserting ocfs2 (/lib/modules/2.6.16.27-0.6-smp/kernel/fs/ocfs2/ocfs2.ko): Unknown symbol in module,

Re: [Ocfs2-users] 1.2.4 symbols

2007-02-09 Thread Sunil Mushran
Appears ocfs2 fs module is not compatible with the other modules ocfs2_dlm.ko, ocfs2_nodemanager.ko, ocfs2_dlmfs.ko, configfs.ko. When you build the modules, ensure you copy all of them in /lib/modules/... before running depmod -a. Randy Ramsdell wrote: Sunil Mushran wrote: What does

Re: [Ocfs2-users] Does OCFS2 support these features?

2007-02-09 Thread Sunil Mushran
No. Lin Shen (lshen) wrote: Hi, Can someone let me know if OCFS2 support the following features. 1. Quota. 2. POSIX ACL 3. Clustered volume manager Lin ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] 1.2.4 symbols

2007-02-09 Thread Sunil Mushran
I do 2. You could also look into adding the modules to /lib/modules/`uname -r`/update. I believe depmod searches that patch before the others. Randy Ramsdell wrote: Sunil Mushran wrote: Appears ocfs2 fs module is not compatible with the other modules ocfs2_dlm.ko

Re: [Ocfs2-users] 1.3.3 mount problem

2007-02-07 Thread Sunil Mushran
The datavolume code is not in mainline. But you should be able to get Oracle RDBMS to work with it. Ensure the init.ora paramater filesystemio_options is set to direct_io. Ivo Maya wrote: Hi, I need to mount ocfs2 with datavolume option on open SuSE 10.2 Machines. ocfs2 is 1.3.3 version and

Re: [Ocfs2-users] OCFS2 1.2.4-2 released

2007-02-06 Thread Sunil Mushran
That's the source. Randy Ramsdell wrote: Mark Fasheh wrote: On Tue, Feb 06, 2007 at 10:18:51AM -0500, Randy Ramsdell wrote: Is source available? http://oss.oracle.com/projects/ocfs2/dist/files/source/v1.2/ocfs2-1.2.4.tar.gz --Mark -- Mark Fasheh Senior

Re: [Ocfs2-users] ocfs2-tools-1.2.2 compile.

2007-02-06 Thread Sunil Mushran
The following patch will address this issue. The fix will be provided with the next tools release. Index: libocfs2/include/ocfs2.h === --- libocfs2/include/ocfs2.h(revision 1269) +++ libocfs2/include/ocfs2.h(revision 1270)

Re: [Ocfs2-users] OCFS2 mount problem

2007-02-05 Thread Sunil Mushran
It could be that the device name is not the same across the two nodes. Do: # mounted.ocfs2 -d on both nodes. Match the device using the uuid. As in, you should see a device with the same uuid on both nodes. If not, then the device is not shared. If you do see the device on both nodes but with

Re: [Ocfs2-users] OCFS2 mount problem

2007-02-05 Thread Sunil Mushran
The device needs to be shared. As in, both nodes need to be able to see the same device concurrently. Refer to iscsi, fiber channel, aoe, etc. aibolit 66 wrote: -Original Message- From: Sunil Mushran [EMAIL PROTECTED] To: aibolit 66 [EMAIL PROTECTED] Date: Mon, 05 Feb 2007 12:46:26

[Ocfs2-users] OCFS2 1.2.4-2 released

2007-02-02 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.2.4-2. This release addresses the lowmem consumption issue that has plagued many users. It also addresses few races in the dlm relating to the lockres migration. The complete list of changes post 1.2.3 is available here:

Re: [Ocfs2-users] unable to configure O2CB_HEARTBEAT_THRESHOLD

2007-01-24 Thread Sunil Mushran
The o2cb script fix is in ocfs2-tools 1.2.2 released Oct 2006. Ping SUSE for the update. [EMAIL PROTECTED] wrote: Using SuSE SP2 Linux running V1.0.8 of OCFS2 and the tools/console that comes with SP2 distribution. I am unable to set the* O2CB_HEARTBEAT_THRESHOLD* parameter in the

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-24 Thread Sunil Mushran
This is not a fs issue. As in the file must be alright. This is a dlm issue. The fs is asking the dlm to free the lock and the dlm is stuck. How many nodes do you have? We've fixed a bunch of dlm bugs since what you appear to be running. davide rossetti wrote: I rebooted the two faulty nodes.

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-23 Thread Sunil Mushran
This was the lvb issue that was fixed long ago. In the 1.2 tree, it was fixed in 1.2.2. 2.6.18 should definitely have the fix for this. davide rossetti wrote: OS: Fedora Core release 4 (Stentz) KERNEL: Linux rack1.ape 2.6.17-1.2142_FC4smp #1 SMP Tue Jul 11 22:57:02 EDT 2006 i686 i686 i386

Re: [Ocfs2-users] ocfs2 kernel bug in Fedora Core 4 update kernel

2007-01-23 Thread Sunil Mushran
wrote: On 1/23/07, *Sunil Mushran* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: This was the lvb issue that was fixed long ago. In the 1.2 tree, it was fixed in 1.2.2. 2.6.18 should definitely have the fix for this. it seems it's even more recent: /var/log/messages.4:Dec

Re: [Ocfs2-users] ocfs2_cdsl_follow_link errors

2007-01-22 Thread Sunil Mushran
#define EACCES 13 /* Permission denied */ The messages are harmless. Patch to silence them has already been checked into the 1.2 repo and mainline git. Matthew Flusche wrote: I’m seeing the following errors in my two node cluster. Is this anything to be concerned with? Host information:

Re: [Ocfs2-users] kernel panic - not syncing

2007-01-22 Thread Sunil Mushran
o2net timeout cannot cause the o2hb panic. The two are totally different. From the outputs, I would guess o2hb is timing out but I cannot say for sure till I don't see the full logs. Andy Phillips wrote: Its worth pointing out that the o2net idle timer is triggering on the network heartbeat,

Re: [Ocfs2-users] kernel panic - not syncing

2007-01-22 Thread Sunil Mushran
:38 -0800, Sunil Mushran wrote: o2net timeout cannot cause the o2hb panic. The two are totally different. From the outputs, I would guess o2hb is timing out but I cannot say for sure till I don't see the full logs. Andy Phillips wrote: Its worth pointing out that the o2net idle timer

Re: [Ocfs2-users] ocfs2 keeps fencing all my nodes

2007-01-18 Thread Sunil Mushran
1. In SLES10, the /config has been moved to /sys/kernel/config. That's how it is on mainline. 2. To monitor heartbeat do: # watch -d -n2 debugfs.ocfs2 -R hb /dev/sdX This comand will work if you have ocfs2-tools 1.2.2. (Not sure whether sles10 ships with 1.2.2 or 1.2.1.) If 1.2.1, do: # watch

[Ocfs2-users] ocfs2-1.2.4 RC2 released

2007-01-17 Thread Sunil Mushran
All, http://oss.oracle.com/~smushran/.ocfs2-1.2.4-0.2/ The final 1.2.4 should look very close to this drop. We still have one slippery issue open that we are working on. But, other than that, this drop is looking good. The list of patches added post 1.2.4-0.1 is as follows: r2948: fs - Allow

Re: [Ocfs2-users] OCFS2 crash

2007-01-16 Thread Sunil Mushran
Looks to be running out of lowmem. # date # cat /proc/meminfo # cat /proc/slabinfo Run a script that dumps the above every 1 to 5 mins. That should help explain the cause. Brian Sieler wrote: Using 2-node clustered file system on DELL/EMC SAN/RHEL 2.6.9-34.0.2.ELsmp x86_64. Config:

Re: [Ocfs2-users] mount error

2007-01-09 Thread Sunil Mushran
You are using two different versions of ocfs2 on the two nodes. Different enough that they are not network compatible. It is working as designed. Consulente3 wrote: Hi, I'm new to ocfs2, and in my test's environment, i have: 2 node, becks and vaix becks can mount ocfs2 fs, but vaix can't.

Re: [Ocfs2-users] Kernel panic - not syncing: ocfs2 is very sorry

2007-01-05 Thread Sunil Mushran
Lot of ink has been spilled on this subject. ;) Check out the heartbeat section in the FAQ. One easy solution is to increase the hb timeout to 60 secs... O2CB_HEARTBEAT_THRESHOLD = 31 We will leaning towards making that number the default in the 1.4 release. George Liu wrote: Both systems

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
depmod -a ? Lin Shen (lshen) wrote: Switched the kernel to 2.6.9-42.Elsmp, still got the same error. [EMAIL PROTECTED] Desktop]# uname -a Linux cfs2 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT 2006 i686 i686 i386 GNU/Linux -Original Message- From: Sunil Mushran [mailto:[EMAIL

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
code is pretty well contained and isolated. while we have discussed tipc, not sure if we ever gave it a serious look. lin -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, January 04, 2007 1:21 PM To: Lin Shen (lshen) Cc: ocfs2-users

Re: [Ocfs2-users] Problem installing OCFS 1.2.3

2007-01-04 Thread Sunil Mushran
theoretically yes... but for practical usage go with atleast iscsi Lin Shen (lshen) wrote: So w/o shared disk, is it possible to make OCFS2 to work by utilizing GNBD or etc? lin -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Thursday, January 04, 2007 2

Re: [Ocfs2-users] update on o2net_idle_timer

2007-01-04 Thread Sunil Mushran
That and also we've seen similar issues with Broadcom TG3 drivers. We use Intel E1000 mostly and thus did not experience the same issue. As far as the configurable net timeouts goes, the patch was added into mainline on Dec 4th. So it will be available with ocfs2 1.4. We are still seeing if we

Re: [Ocfs2-users] re: is it possible for the o2cb stack to monitor multiple clusternames on the same box

2006-12-20 Thread Sunil Mushran
, they will never be part of that domain. Sunil Mushran wrote: Currently it supports only one cluster. Peter Santos wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Folks, When I installed ocfs2 the first time and setup oracle to work with it, the clustername defaulted to ocfs2. We

Re: [Ocfs2-users] Oracle Application Server 10.1.2.0.2 Install on OCFS2

2006-12-06 Thread Sunil Mushran
strace apache. That may provide us with some clues. [EMAIL PROTECTED] wrote: Hello all, Has anyone installed Oracle Application Server 10.1.2.0.2 Infrastructure tier including the preseeded 10.1.0.4 database (High Availability option otherwise known as a cold failover cluster) on OCFS2

Re: [Ocfs2-users] OCFS2 and berkeley database files

2006-12-06 Thread Sunil Mushran
ocfs2 supports private mmap r/w and shared mmap readonly. Shared mmap writeable is the only piece missing. We should have that by 1.4. Alexei_Roudnev wrote: There was a clear answer, WHY it did not worked on OCFSv2: - BerkleyDB and LDAP uses mmap to the files; - OCFSv2 don't implement it

Re: [Ocfs2-users] OCFS2 and berkeley database files

2006-12-05 Thread Sunil Mushran
You are on a very old release of OCFS2. The OCFS2 homepage and FAQ both list a SLES9 kernel version newer than the one you are using. But that may not be the reason for the error. My bet is that bdb is attempting to create a shared writeable mmap that ocfs2 1.2 does not support. [EMAIL

Re: [Ocfs2-users] Oracle 9i RAC on OCFS2

2006-11-27 Thread Sunil Mushran
Refer to CDSL (Conext Dependent Symbolic Links) in the OCFS2 user's guide. Marcel Savelkoul wrote: Hi, I'm setting up a 2-node Oracle 9i RAC on OCFS2. But I have some problems with understanding how the shared Oracle_Home is being used. For instance there is the

Re: [Ocfs2-users] re: o2hb_write_timeout:270 ERROR: Heartbeat write timeout

2006-11-22 Thread Sunil Mushran
is this message getting 10 seconds from? Also this message is displayed because dbo2 was not able to check into the hearbeat filesystem right ? - -peter Sunil Mushran wrote: On nodes db01 and db03 hb timed-out at 17:12:49. However, the nodes did not fully panic. As in, the network

Re: [Ocfs2-users] Bad magic number in inode

2006-11-15 Thread Sunil Mushran
The quick detect just looks for the superblock which is in the third block of the device. The full detect looks up the superblock and then the system directory. In your case it fails to locate the latter. This is one of the quirks when using an unpartitioned disk and later partitioning it. The

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
input is greatly appreciated. Thanks, Colin Farley Network Administrator E-Care Contact Center Services Phone:(204) 940-6244 Fax:(204) 940-7394 Sunil Mushran

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
system). Well known problem with OCFSv2. One solution is to add 3-d node and use interface bonding (be sure that interface convergeency time is less that o2cb timeout). - Original Message - From: [EMAIL PROTECTED] To: Sunil Mushran [EMAIL PROTECTED] Cc: ocfs2-users@oss.oracle.com Sent

Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem

2006-11-15 Thread Sunil Mushran
and everything will change. - Original Message - From: Sunil Mushran [EMAIL PROTECTED] To: Alexei_Roudnev [EMAIL PROTECTED] Cc: [EMAIL PROTECTED]; ocfs2-users@oss.oracle.com Sent: Wednesday, November 15, 2006 11:03 AM Subject: Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem You

Re: [Ocfs2-users] Ocfs2 errors on 3 node cluster

2006-11-14 Thread Sunil Mushran
It will be easier if you file a bug on oss.oracle.com/bugzilla with all the details. Like messages files from all nodes, etc. Why are you using 1.2.1? 1.2.3 has been out for few months now. Randy Ramsdell wrote: Hi, Maybe someone could elaborate on these re-occuring ocfs2 errors that always

Re: [Ocfs2-users] frozen ocfs2 filesystem under heavy webserver load

2006-11-13 Thread Sunil Mushran
None of these locks are busy. So they should not be the cause of the problem. Start with the version of ocfs2. Also, which kernel? What does top say? Is some process spinning? Also, what does this stresstest entail? Stephan Hendl wrote: Hi, I use a cluster of 4 nodes with ocfs2 as a

Re: [Ocfs2-users] about 2 nodes enviroment and metalink note 394827.1

2006-11-09 Thread Sunil Mushran
I would imagine you are using RHEL4. If so, upgrade the ocfs2-tools to 1.2.2. The previous version of the ocfs2 init script did not always umount ocfs2 volumes on clean shutdowns leading to this problem. [EMAIL PROTECTED] wrote: Hi to all: In 2 nodes environment I've 'suffered' the 'reboot 1st

Re: [Ocfs2-users] OCFS2 Block / Clustersize with Oracle 10gR2

2006-11-09 Thread Sunil Mushran
of 3,400 IO/sec while the same benchmark with the same data will max out at 7K+ IO/sec on RAW. I'll grab the iostat data which we've kept over time and try to make some sense of it before posting anything additional. Thanks. /Brian/ On Thu, 2006-11-09 at 10:20 -0800, Sunil Mushran wrote: Why

Re: [Ocfs2-users] Newbie questions -- is OCFS2 what I even want?

2006-11-03 Thread Sunil Mushran
You are probably looking for a distributed file system. Check out afs and/or v9fs. Thad Beier wrote: Dear Sirs and Madams, I run a small visual effects production company, Hammerhead Productions. We'd like to have an easily extensible inexpensive relatively high-performance storage network

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
Are you using NFS by any chance? I am looking into bug#790 that also encounters the same error (ESTALE). Matthew Flusche wrote: I received the following error messages in the system logs. Is this anything to be concerned with? kernel: (4074,0):ocfs2_populate_inode:234 ERROR: Invalid

Re: [Ocfs2-users] Interesting Error

2006-10-31 Thread Sunil Mushran
Replace sdX with the device on which the ocfs2 fs exists. You can use mount | grep ocfs2 to find that volume. If the inode on disk is good, one explanation for the issue could be the lvb bug which was fixed in 1.2.2. Ping Novell to get a PTF kernel with ocfs2 1.2.3. Andy Kipp wrote: Which

Re: [Ocfs2-users] ocfs2 error messages

2006-10-31 Thread Sunil Mushran
So it is bug#790. It just may be a case of unnecessary error messages for you. I am still investigating it. Matthew Flusche wrote: Yes, one of the clustered file systems is shared with nfs. -Original Message- From: Sunil Mushran [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 31

Re: [Ocfs2-users] Ocfs2 and low memory

2006-10-31 Thread Sunil Mushran
To monitor ocfs2 memory usage, do: # cat /proc/slabinfo | egrep 'ocfs|dlm|size-256 |size-32 ' ocfs2_lock16226 16 2261 : tunables 120 60 0 : slabdata 1 1 0 ocfs2_inode_cache 22 24 115231 : tunables 24 12 0 : slabdata 8

Re: [Ocfs2-users] Interesting Error

2006-10-30 Thread Sunil Mushran
Which version of OCFS2? Did you run fsck.ocfs2 -f on that device? Do: # echo stat 6518860 | debugfs.ocfs2 -n /dev/sdX /tmp/ext.out Email ext.out. Andy Kipp wrote: Anybody have any idea what this error involves? Or how to resolve it? Oct 30 05:11:24 groupwise-1-mht kernel:

Re: [Ocfs2-users] BUG: unable to handle kernel NULL pointer dereference

2006-10-27 Thread Sunil Mushran
Please file a bugzilla with the details provided. It is easier to manage bugs that a way. Thanks Christian Schlittchen wrote: Thanks to syncronous writes on the log-files I finally managed to get a log of the regular panics we experience. The setup is as follows: Three blades (IBM HS20)

Re: [Ocfs2-users] Unexpected reboot / crash

2006-10-27 Thread Sunil Mushran
The first issue could be because you don't have ocfs2-tools 1.2.2. The earlier version was missing a line in the ocfs2 init script. Rafal Maliszewski wrote: Hi guys I installed ocfs2 on 4 node (redhat 4u3) on shared FC devices ( EMC storage ). So I've noticed several problems: 1. When I

Re: [Ocfs2-users] 1.2.2 dump issue

2006-10-25 Thread Sunil Mushran
As the ocfs2 home page suggests, when building 1.2.x against mainline 2.6.14 and above, specify GENERIC_DELETE_INODE_NOT_TRUNCATES=1. Peter Larsen wrote: I'm running 1.2.2 here - compiled from source, and while I can read files, trying to delete a file on my OCFS2 volume produces the following:

Re: [Ocfs2-users] lvm2 not cluster aware - okay, so how should Istripe my LUNs?

2006-10-25 Thread Sunil Mushran
Fabio Corazza wrote: Last but not least.. a question for Sunil if he's gonna read this.. when OCFS2 will support data-on-inode would we need to reformat the file systems or will the new module be compatible with the 1.4 on-disk data? I am envisioning a compat flag to be added on existing

Re: [Ocfs2-users] OCFS2 Fencing and Locking MSA500 Array: Help

2006-10-25 Thread Sunil Mushran
Oct 11 05:15:28 vhaispora01 kernel: cciss0: unsolicited abort f7000250 Oct 11 05:15:28 vhaispora01 kernel: cciss0: retrying f7000250 That's where the problem begins. The cciss driver is unable to to complete the ios due to a bus reset maybe. Ping HP or whoever your contact is for the MSA500.

Re: [Ocfs2-users] RHEL 4 hotfix RPMs?

2006-10-23 Thread Sunil Mushran
# ./configure --with-kernel=/usr/src/kernels/2.6.9-42.X.EL-smp-i686/ # make rhel4_2.6.9-42.X.EL_rpm The rpms will be in the rpmdir as specified in ~/.rpmmacros. ~$ cat .rpmmacros %_topdir/rpmbuild/user %_tmppath /rpmbuild/user/tmp %_sourcedir /rpmbuild/user/SOURCES %_specdir

Re: [Ocfs2-users] SUSE Patches

2006-10-20 Thread Sunil Mushran
Ping Novell. They issue interim PTF SLES kernels with the required fix(es) to help users tide over until the formal release. Needless to add, you need to have Novell Support. Andy Kipp wrote: Hello all, I am running SLES9 with the latest kernel patches (2.6.5-7.282-bigsmp) and ocfs2 version

Re: [Ocfs2-users] Getting Started with ocfs2

2006-10-11 Thread Sunil Mushran
Martin J. Evans wrote: fine but on selecting cluster/configure nodes I still get dialogue saying Could not query the state of the cluster stack. This must be resolved before any OCFS2 filesystemcan be mounted. Could be because the script is installed as o2cb and not o2cb.init. Fedora

Re: [Ocfs2-users] out of memory... doing heavy IO on ocfs2 is wasting (low) memory?!

2006-10-11 Thread Sunil Mushran
Still in testing. It is a larger patch than normal and thus requires more time/effort. Once we are comfortable with it, we will look into releasing the patch for others to test before releasing 1.2.4. Jonah H. Harris wrote: What's the status on this? I've researched Bugzilla, SVN, and the

[Ocfs2-users] disk heartbeat timeout poll

2006-10-11 Thread Sunil Mushran
Thanks for all the replies in the previous usage poll. One of the chief concerns expressed was the (very) low default disk heartbeat timeout setting. Well, we want to bump it up but to what? Here are some qs the answers to which will help us determine that value. 1. What is the your disk

Re: [Ocfs2-users] 2 Node cluster, and nodes OS hang

2006-10-06 Thread Sunil Mushran
tcpdump -i eth1 -C 10 -W 15 -s 1 -Sw /tmp/`hostname -s`_tcpdump.log -ttt 'port ' Do this on both nodes before mounting on the second node. Ping me with the path to the logs. [EMAIL PROTECTED] wrote: Hello All, I have a NAS that I would like to use ocfs2 on. Currently there are

Re: [Ocfs2-users] Resizing mountpoint in ocfs2

2006-10-05 Thread Sunil Mushran
Yes, the last patch to add this feature is in review. We will release this as part of ocfs2-tools 1.2.2. Kerr-Sheppard, Stephen wrote: Has anyone had to resize a mountpoint in ocfs2. In ocfs version 1 it was a case of unmounting and using the resizeocfs command. Is this still the same for

Re: [Ocfs2-users] Re: FW: Use of OCFS2 file systems.

2006-10-04 Thread Sunil Mushran
File a bug on bugzilla (oss.oracle.com/bugzilla) with the full oops trace and any other information that seems relevant. Galan Merchan, Martin wrote: Hello, I’m working with OCFS2 on Radhat Advanced Server 4 Patch 3 and I had kernel panics too. I use OCFS2 only for RAC archive logs and RMAN

Re: [Ocfs2-users] Use of OCFS2 file systems.

2006-09-29 Thread Sunil Mushran
Yes. Bill Wells wrote: All, Can someone comment on whether it is recommended to use the OCFS2 file system for the admin directories of a RAC database. Specifically, for bdump, udump, cdump, etc. This is being considered on RHEL4-U4 with 10gR2 on a 3 node cluster. Thanks much, Bill Wells

<    4   5   6   7   8   9   10   >