Re: [Ocfs2-users] Reservation conflicts

2010-12-15 Thread Sunil Mushran
, 2010 at 3:25 PM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: I meant repeats 60 secs at a stretch. If not, as it seems so, then the messages should be only annoying. VMFS uses SCSI Reservation to perform disk based locking. See if they have some

Re: [Ocfs2-users] loss of connection

2010-12-15 Thread Sunil Mushran
be kvm related. Maybe. I am guessing here. See the ubuntu bug db. Maybe they have another report of a similar issue. That may tell us more. On 12/14/2010 11:17 PM, Andreas Rittershofer wrote: Am 15.12.2010 um 08:04 schrieb Sunil Mushran: On 12/14/2010 10:59 PM, Andreas Rittershofer wrote: My

Re: [Ocfs2-users] Reservation conflicts

2010-12-14 Thread Sunil Mushran
sd 1:0:0:0: reservation conflict That's the cause of the error in the guest. You'll have to track the error to ESX's management domain. See the logs. Does this error come repeatedly? This error is only a problem for o2hb if it continues for the next 60 secs. Else it can be ignored. On

Re: [Ocfs2-users] Reservation conflicts

2010-12-14 Thread Sunil Mushran
):o2hb_do_disk_heartbeat:753 ERROR: status = -5 On Tue, Dec 14, 2010 at 11:38 AM, Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com wrote: sd 1:0:0:0: reservation conflict That's the cause of the error in the guest. You'll have to track the error to ESX's

Re: [Ocfs2-users] loss of connection

2010-12-14 Thread Sunil Mushran
On 12/14/2010 10:59 PM, Andreas Rittershofer wrote: My log says suddenly: Dec 14 02:35:16 hp1 kernel: [1492482.232822] o2net: no longer connected to node hp2 (num 1) at 192.168.1.2: Dec 14 02:35:18 hp1 kernel: [1492483.960150] BUG: soft lockup - CPU#1 stuck for 61s! [kvm:32398] I

Re: [Ocfs2-users] Slow I/O on ocfs2 file system

2010-12-13 Thread Sunil Mushran
Can you email me the following: debugfs.ocfs2 -R stats /dev/mapper/3600a0b800067df8303ba4ba0f5ef debugfs.ocfs2 -R stat dd-1G /dev/mapper/3600a0b800067df8303ba4ba0f5ef This will tell us the fs params and the file layout. Next run the same with different blocksizes. Run the same with

Re: [Ocfs2-users] Extremely poor write performance, but read appears to be okay

2010-12-13 Thread Sunil Mushran
Please can you add this info to a bugzilla to make it easier to track. I delete emails freq and thus lose context. To that, attach the output of the following script. http://oss.oracle.com/~smushran/debug/scripts/stat_sysdir.sh Also, can you isolate the issue to a file. As in, even if you are

Re: [Ocfs2-users] servers blocked on ocfs2

2010-12-09 Thread Sunil Mushran
The interconnect is the problem. Don't use crossover cables. Use a gige link with a proper switch. That's what the world uses. On 12/09/2010 02:10 AM, frank wrote: Hi, we have recently started to use ocfs2 on some RHEL 5.5 servers (ocfs2-1.4.7) Some days ago, two servers sharing an ocfs2

Re: [Ocfs2-users] kernel BUG at fs/ocfs2/dlm/dlmthread.c:266

2010-12-08 Thread Sunil Mushran
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.26.y.git;a=blob;f=fs/ocfs2/dlm/dlmthread.c;h=4060bb328bc8a08c22bbd77c59835d757ebdcda5;hb=refs/tags/v2.6.26.2#l265265 if (dlm_purge_lockres(dlm, lockres)) 266 BUG(); Known issue.

Re: [Ocfs2-users] problem mounting OCFS2 into 2 nodes

2010-12-08 Thread Sunil Mushran
ocfs2 is a shared-disk cluster file system. It does not do replication. If you want to use local disks, you could explore using drbd (replicating block device). Google drbd and ocfs2 for more on this. On 12/07/2010 11:11 PM, Linux Cook wrote: Hi! I've just setup DM-Multipath with OCFS2 for

Re: [Ocfs2-users] Extremely poor write performance, but read appears to be okay

2010-12-08 Thread Sunil Mushran
http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commitdiff;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b That specific error has been addressed in the upcoming 1.4.8. Attach the logs and all other info to a bugzilla. On 12/08/2010 05:07 PM, Daniel McDonald wrote: Hello, I'm writing from the

Re: [Ocfs2-users] Two-node cluster often hanging in o2hb/jdb2

2010-12-07 Thread Sunil Mushran
Check the kernel stack of the D state processes. cat /proc/PID/stack The kernel stack will tell us where it is waiting. My guess is that the io stack is slow. Slow ios appear as temporary hangs to the users. On 12/07/2010 07:45 AM, Jan Wielemaker wrote: Hi, I'm pretty new to ocfs2 and a bit

Re: [Ocfs2-users] new release 1.4.8.1

2010-12-07 Thread Sunil Mushran
On 12/07/2010 10:36 AM, Ulf Zimmermann wrote: Sorry, I cannot comment on future product releases. Sunil, he was asking about RHEL 6, not 5. My answer was for RHEL6. ___ Ocfs2-users mailing list Ocfs2-users@oss.oracle.com

Re: [Ocfs2-users] Random node reboots

2010-12-07 Thread Sunil Mushran
:*Sunil

Re: [Ocfs2-users] new release 1.4.8.1

2010-12-06 Thread Sunil Mushran
On 12/05/2010 07:19 AM, Dag Wieers wrote: On Fri, 26 Nov 2010, veeraa bose wrote: I have found the new release of ocfs2 in http://oss.oracle.com/projects/ocfs2/files/RedHat/RHEL5/x86_64/1.4.8-1/ , where I find the specification about the new release. Yes, the website doesn't seem to have

Re: [Ocfs2-users] new release 1.4.8.1

2010-12-06 Thread Sunil Mushran
On 12/06/2010 03:34 PM, Dag Wieers wrote: OCFS2 1.4.8 is not released yet. Yes, some rpms have been uploaded. But it is still a work-in-progress. We will send out an announcement once all that is completed. Hopefully sometime this week. OCFS2 1.6 is not a separate package but is part of the

Re: [Ocfs2-users] Compiling ocfs2-tools-1.6.3 on slackware64-13.1

2010-11-30 Thread Sunil Mushran
On 11/30/2010 10:03 AM, pica pica wrote: Hello all, I'm having trouble compiling ocfs2-tools-1.6.3 on slackware64-13.1 These are my configure flags (the last 2 I added after googling this problem, but didn't help): ./configure \ --sbindir=/sbin \ --bin=/bin \

Re: [Ocfs2-users] Compiling ocfs2-tools-1.6.3 on slackware64-13.1

2010-11-30 Thread Sunil Mushran
It looks ok as far as the ocfs2-tools build scripts are concerned. Note no -static. But I don't know anything about slackware build environment. Easiest would be to rerun the same command with -lpthread. See if that works. # cd .../fsck.ocfs2 # gcc -O2 -fPIC -Wall -Wstrict-prototypes

Re: [Ocfs2-users] Compiling ocfs2-tools-1.6.3 on slackware64-13.1

2010-11-30 Thread Sunil Mushran
On 11/30/2010 07:47 PM, pica pica wrote: Some progress: I added -lpthread my build script (see bellow) and I got past fsck.ocfs2, but now get stuck at o2image :( It seems that -lpthread was not used here, but I could see that it was everywhere else. Is this right? make[1]: Entering

Re: [Ocfs2-users] How to overcome 32000 subdirs limit

2010-11-25 Thread Sunil Mushran
it: gcc make python-dev python-gtk2 pkg-config e2fslibs-dev uuid-dev libncurses5-dev libreadline-dev libglib2.0-dev 24 ноября 2010 г. 0:25 пользователь Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com написал: Probably better if you deinstall the package. That way

Re: [Ocfs2-users] Understanding debugfs.ocfs2 output

2010-11-24 Thread Sunil Mushran
. Would it be safe to assume then, that only when -all- of the Contig numbers in the extent_alloc output fall below the Clusters per Group X Bits per Cluster number is the no space issue triggered? -Norm On Tue, 2010-11-23 at 14:39 -0800, Sunil Mushran wrote: The length of allocator chains

Re: [Ocfs2-users] How to overcome 32000 subdirs limit

2010-11-23 Thread Sunil Mushran
) and ready to install it. I'm going to use checkinstall to build package. So, do I need to uninstall my 1.4.1-1 ocfs2-tools or I can install it above? Thanks again. 20 ноября 2010 г. 20:41 пользователь Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com написал: Whether

Re: [Ocfs2-users] Understanding debugfs.ocfs2 output

2010-11-23 Thread Sunil Mushran
The length of allocator chains in the global bitmap depends on the size of the volume and block/cluster sizes. It is created during format and only grows if the volume is grown. That's it. On 11/23/2010 11:04 AM, Norman P. B. Joseph wrote: This is related to the No space on OCFS2 volume error

Re: [Ocfs2-users] kernel BUG at fs/dlm/lowcomms.c:647!

2010-11-22 Thread Sunil Mushran
What does the stack of [ocfs2rec] thread look like? On 11/22/2010 07:31 AM, Welterlen Benoit wrote: Hi, I have a new question ;-) Our cluster is now working with sctp and pcmk stack. When a node is lost or the eth is down, pcmk is fencing the other node and a recovery is made and all

Re: [Ocfs2-users] How to overcome 32000 subdirs limit

2010-11-20 Thread Sunil Mushran
to work? Thanks. 2010/11/19 Sunil Mushran sunil.mush...@oracle.com mailto:sunil.mush...@oracle.com Yes, indexed dirs removes this limit. For that you'll need to upgrade the kernel/fs and the tools. Best if the kernel is .32 or higher. The tools should be 1.6.x. Debian has a drop

Re: [Ocfs2-users] How to overcome 32000 subdirs limit

2010-11-19 Thread Sunil Mushran
Yes, indexed dirs removes this limit. For that you'll need to upgrade the kernel/fs and the tools. Best if the kernel is .32 or higher. The tools should be 1.6.x. Debian has a drop of 1.6.3 tools. Check if they have built it. You can read about this and other features in the ocfs2 1.6 user's

Re: [Ocfs2-users] mount on fsck

2010-11-11 Thread Sunil Mushran
Did you notice this with the classic o2cb stack or are you using pacemaker/cman? On Nov 11, 2010, at 4:25 AM, Dzianis Kahanovich maha...@bspu.unibel.by wrote: drbd+ocfs2 on Gentoo (own modifyed ebuilds). Usually I run fsck.ocfs2 -fy on boot and it happened only on both nodes reboot, mount

Re: [Ocfs2-users] Pb with ocfs2 dlm on Fedora 13

2010-11-09 Thread Sunil Mushran
Is iptables running on node3? If so, stop it. On 11/9/2010 3:42 PM, Alain.Moulle wrote: Hi Tao, yes , on the three nodes theMax Node Slots is 8 echo 'stats'|debugfs.ocfs2 /dev/sdc1|grep Slots debugfs.ocfs2 1.4.3 Max Node Slots: 8 Regards, Alain Tao Ma a écrit : Hi Alain, On

Re: [Ocfs2-users] Problem with datavolume option

2010-10-31 Thread Sunil Mushran
The datavolume mount option is not necessary for mounts that hold the db files. You could instead set filesystemio_options in init.ora as explained in the user's guide. On 11/1/2010 12:37 AM, Rafa Carreres wrote: Hi, The facts I have a cluster running Oracle RAC with two nodes accesing a

Re: [Ocfs2-users] No space left on device error, older kernels?

2010-10-27 Thread Sunil Mushran
on 2.6.32... ...maybe there is a case for getting this sort of extremely useful feature into the 2.6.32 trunk? (The no space left on device problem is arguably a bug anyway...) Just a thought from the peanut gallery. - Pat On Wed, Oct 27, 2010 at 10:20 AM, Sunil Mushran sunil.mush

Re: [Ocfs2-users] ocfs2_delete_inode kernel bug

2010-10-26 Thread Sunil Mushran
You'll have to run fsck.ocfs2 -fy to clear the older orphaned inodes. The upgrade will prevent future occurrences of this issue. On 10/26/2010 02:43 AM, Andre Nathan wrote: On Thu, 2010-10-21 at 10:52 -0700, Sunil Mushran wrote: That said, the first issue (-17) is a known one that was fixed

Re: [Ocfs2-users] ocfs2_delete_inode kernel bug

2010-10-26 Thread Sunil Mushran
On 10/26/2010 09:59 AM, Andre Nathan wrote: I actually did that right after rebooting into the new kernel. The system ran fine for a day, but the kernel bug showed up after the backup script ran. My setup is the following: I have two servers sharing OCFS2 filesystems through one dedicated

Re: [Ocfs2-users] ocfs2_delete_inode kernel bug

2010-10-26 Thread Sunil Mushran
On 10/26/2010 10:39 AM, Andre Nathan wrote: On Tue, 2010-10-26 at 10:14 -0700, Sunil Mushran wrote: So the backup server is not part of the cluster but yet reading the same block device. As long as it is only reading, it should not affect the two nodes, but I will not trust the backup

Re: [Ocfs2-users] Reg: ocfs2 two node cluster crashed, node2 crashed, when I rebooted node1 for maintenance.

2010-10-25 Thread Sunil Mushran
Means that the reboot is not shutting down the services in order. Ensure ocfs2 fs is unmounting before the network shutdown. On 10/23/2010 11:27 AM, veeraa bose wrote: Hi All, We have ocfs2 node cluster with oracle 11G RAC running, The node2 got crashed automatically, when i rebooted node one

Re: [Ocfs2-users] ocfs2_delete_inode kernel bug

2010-10-21 Thread Sunil Mushran
This does not look like a 2.6.35. The stack trace says 2.6.32. But I could not match it to a stable 2.6.32 too. It looks more like 2.6.32 + patches. It is hard to diagnose a problem without the source. That said, the first issue (-17) is a known one that was fixed in 2.6.34. commit

Re: [Ocfs2-users] Periodic hangs

2010-10-15 Thread Sunil Mushran
Take a stack trace of the hang. If you are on 2.6.32, you could do: # find /proc -name stack | while read A ; do D=$(dirname $A); echo $A; cat $D/cmdline; echo ; cat $A; echo ; done; Attach the output to a bugzilla on oss.oracle.com. On 10/15/2010 08:16 AM, Emil Noether wrote: Hi, I have a

Re: [Ocfs2-users] Periodic hangs

2010-10-15 Thread Sunil Mushran
it is a prime time of my web and my customers are already quite upset. But I can try it tommorow morning. Regards, Emil Noether On 10/15/2010 07:22 PM, Sunil Mushran wrote: Take a stack trace of the hang. If you are on 2.6.32, you could do: # find /proc -name stack | while read A ; do D

Re: [Ocfs2-users] ocfs2 1.4.4 and 1.4.7 compatibility

2010-10-06 Thread Sunil Mushran
It should be fine... as far as the code is concerned. But I'll let someone who has actually run it in prod, confirm. On Thu, 2010-10-07 at 13:59 +1100, Ram Smith wrote: Hi Crew, After an aborted upgrade last night we are left with some nodes on our Oracle RAC cluster running ocfs2 1.4.4 and

Re: [Ocfs2-users] Reg: ocfs2_wait_for_mask [nfsd]

2010-10-04 Thread Sunil Mushran
Are you using the mount option nordirplus on the nfs client? The users guide has more on this. On Oct 4, 2010, at 7:23 AM, veeraa bose veeraa.b...@gmail.com wrote: Hi All, we have installed the below ocfs2 version and exported the ocfs2 FS over nfs to some other nfs clients. rpm -qa |

[Ocfs2-users] OCFS2 1.6 release

2010-09-28 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.6. The file system is bundled with Unbreakable Enterprise Kernel that is available for Oracle Linux 5. The user's guide for this release is available at the following link.

Re: [Ocfs2-users] Server crash

2010-09-22 Thread Sunil Mushran
] - Am 21.09.2010 22:47, schrieb Sunil Mushran: There should have been another message possible just above the cut here saying possibly that the there were not enough credits, or something about a running or committing transaction. On 09/21/2010 12:38 AM, Georg Höllrigl wrote: Hello

Re: [Ocfs2-users] Server crash

2010-09-21 Thread Sunil Mushran
There should have been another message possible just above the cut here saying possibly that the there were not enough credits, or something about a running or committing transaction. On 09/21/2010 12:38 AM, Georg Höllrigl wrote: Hello, I got a crashed server when using ocfs2 on SLES10 with

Re: [Ocfs2-users] General protection fault

2010-09-17 Thread Sunil Mushran
It is oopsing 6-8 secs after the mount. ;) The stack trace does not show ocfs2/drbd. It is pointing to slub. But you have to read that with a pinch of salt. This just could be a case of some memory being scribbled. On 09/17/2010 11:39 AM, Andre Nathan wrote: Hello I have an active-active DRBD

Re: [Ocfs2-users] tunefs.ocfs2 -Q question

2010-09-16 Thread Sunil Mushran
On 09/16/2010 09:06 AM, Enrique Sanchez wrote: Currently a large number of OCFS2 clusters and while the paperwork trolls have been pretty good at keeping us in pretty decent shape, I've been trying to create dynamic documentation of the several servers we maintain, right now I am hitting the

Re: [Ocfs2-users] Extremely high memory usage and iowait times

2010-09-07 Thread Sunil Mushran
Can you describe your usage and your setup a bit? On 09/07/2010 11:55 AM, Dan Lark wrote: I know this has been discussed before, but I am seeing high iowait times and the occasional deadlock between my two node OCFS2 cluster. I will be turning on noatime for the mount on both nodes in a

Re: [Ocfs2-users] No space left on device

2010-09-07 Thread Sunil Mushran
Which kernel are you using? We have fixed this issue in mainline. We will soon have the same fix for production kernels. On 09/07/2010 02:06 PM, Todd Freeman wrote: From reading the archives I can see this issue has been hit before but I haven't found a resolution. I have a 50gb

Re: [Ocfs2-users] Servers reboot - may be OCFS2 related

2010-09-03 Thread Sunil Mushran
The stack points to netlink. ocfs2 does not use netlink. That it reproduced with ocfs2 may just mean that the particular load triggers it. That's it. On 09/03/2010 05:21 AM, Proskurin Kirill wrote: Hello. What we have: 2x Debian 5.0 x64 - 2.6.32-20~bpo50+1 from backports DRBD + OCFS2 1.4.1-1

Re: [Ocfs2-users] Doubts about OCFS2 Performance

2010-07-28 Thread Sunil Mushran
Can you attach some info in the bz. Like a iostat run for a minute or more. top output, showing active processes. If the slowdown is being observed by any one process, maybe a strace -p pid -T -ttt -o /tmp/out output. On 07/28/2010 05:28 AM, Jeronimo Bezerra wrote: Guys, comments or

Re: [Ocfs2-users] Too much journaling or not ?

2010-07-27 Thread Sunil Mushran
Have you tried mounting with data=writeback ? On Jul 27, 2010, at 9:31 PM, wanchat padungrat wancha...@pantip.com wrote: Dear all, Not realy sure whether this is bug or not, but we found that sometimes OCFS2 on our system do journaling a lot. (Please see screen shot below) As you can

Re: [Ocfs2-users] some beginner questions

2010-07-14 Thread Sunil Mushran
ocfs2 is a shared disk cfs. Meaning it expects the disk/vol to be accessible by all nodes. Using fiber channel, iscsi, etc. On 07/14/2010 12:03 PM, Alexander Nagel wrote: Hi, I'am new to ocfs2 filesystem and I have some questions about it. I installed three server according to the user guide

Re: [Ocfs2-users] dlmfs_unlink errors

2010-07-12 Thread Sunil Mushran
You are attempting to remove a lock resource that is still active. This is an app bug. On 07/06/2010 11:51 AM, Charlie Sharkey wrote: I'm seeing an occasional error on the nodes from a two node cluster. Is this something I should be concerned about ? Sles10 sp2 2.6.16.60-0.34-smp x86_64

Re: [Ocfs2-users] Not able to flashcopy ocfs2 mount point

2010-06-30 Thread Sunil Mushran
tunefs.ocfs2 --cloned-volume The man page has the details. Ensure you run the command only on the cloned volume. On Jun 30, 2010, at 3:19 AM, Devender Narula devendernar...@yahoo.com wrote: Hi guys i got one Ocfs2 mount point running on RHEL 5.4 .. we are about to configuring

Re: [Ocfs2-users] df showing wrong size

2010-06-28 Thread Sunil Mushran
On 06/28/2010 07:57 AM, Patrick J. LoPresti wrote: On Sun, Jun 27, 2010 at 11:17 PM, Garcia, Raymundo raymundo.gar...@philips.com wrote: Hello… it was put under my attention that a partition we have in one of our production system was displaying wrong size with df command…. 123 GB… but

Re: [Ocfs2-users] df showing wrong size

2010-06-28 Thread Sunil Mushran
On 06/28/2010 09:37 AM, Patrick J. LoPresti wrote: On Mon, Jun 28, 2010 at 9:29 AM, Sunil Mushransunil.mush...@oracle.com wrote: ocfs2 is a journaled file system. But it is also a clustered file system. So it cannot arbitrarily delete orphaned files because they could still be in use

Re: [Ocfs2-users] Non-clean fsck on almost-new filesystem

2010-06-16 Thread Sunil Mushran
For issues on sles, file a bz with Novell. Make sure you list out the fs features that had been enabled. On 06/15/2010 07:08 PM, Patrick J. LoPresti wrote: My O/S is Suse Linux Enteprise Server 11 Service Pack 1. My SCSI device is a hardware iSCSI RAID chassis. I have done a variety of reads

Re: [Ocfs2-users] Info on Version Upgrade

2010-06-16 Thread Sunil Mushran
As far as ocfs2 is concerned, the current version of ocfs2 1.2 is ocfs2 1.2.9. You will find the packages for your kernel on oss.oracle.com. The news section has the list of changes/bugs fixed. asmlib also has some updates. You can review the fixes to see whether an upgrade is warranted. For

Re: [Ocfs2-users] Diagnosing some OCFS2 error messages

2010-06-14 Thread Sunil Mushran
- bpkr...@gmail.com wrote: Patrick J. LoPresti lopre...@gmail.com 2010-06-13 19:14: Hello. I am experimenting with OCFS2 on Suse Linux Enterprise Server 11 Service Pack 1. I am performing various stress tests. My current exercise involves writing to files using a

Re: [Ocfs2-users] Diagnosing some OCFS2 error messages

2010-06-14 Thread Sunil Mushran
- lopre...@gmail.com wrote: Hello. I am experimenting with OCFS2 on Suse Linux Enterprise Server 11 Service Pack 1. I am performing various stress tests. My current exercise involves writing to files using a shared-writable mmap() from two nodes. (Each node mmaps and writes to

Re: [Ocfs2-users] OCFS2 and huge ( 50TB) partitions

2010-06-11 Thread Sunil Mushran
We could remove this check. If you want this in your sles kernel, the quickest route will be via Novell. We'll need both sles and mainline patched. On Jun 11, 2010, at 5:54 PM, Patrick J. LoPresti lopre...@gmail.com wrote: Hello. I am experimenting with OCFS2 on a brand new 10GigE iSCSI

Re: [Ocfs2-users] Heartbeat threshold and data corruption

2010-06-07 Thread Sunil Mushran
Other than a longer pause. No other negative side effects. Processes that have the locks at the required level will continue to do io. Processes that need to upconvert a lock and need the dead node to respond, will have to wait for the deadthreshold to expire before recovery can clean out

Re: [Ocfs2-users] ocfs2 tools, bug 1255

2010-06-04 Thread Sunil Mushran
You mean the next version of 1.4 tools? Not anytime soon I'm afraid. That patch needs to be pushed to the 1.4 branch before you can build it. On 06/04/2010 10:42 AM, Ulf Zimmermann wrote: Sunil, Tao, Can you tell me an estimation when a new tools package with this patch will be available, I

Re: [Ocfs2-users] debugfs.ocfs2 and Feature Incompat

2010-06-01 Thread Sunil Mushran
On 06/01/2010 04:06 AM, Stefan Priebe - allied internet ag wrote: snip Feature Compat: 3 backup-super strict-journal-super Feature Incompat: 8016 sparse extended-slotmap inline-data metaecc xattr indexed-dirs refcount Tunefs Incomplete: 0 Feature RO

Re: [Ocfs2-users] debugfs.ocfs2 and Feature Incompat

2010-06-01 Thread Sunil Mushran
was formatted using: mkfs.ocfs2 --fs-feature-level=max-features -L ocfs2disk -N 10 -T mail -v /dev/sdb So i thought that i can now use features like unwritten, inline-data and indexed-dirs. Stefan Sunil Mushran schrieb: On 06/01/2010 04:06 AM, Stefan Priebe - allied internet ag wrote: snip

Re: [Ocfs2-users] OCFS2 performance - disk random access time problem

2010-06-01 Thread Sunil Mushran
The kernel is old. We fixed this issue in 2.6.30. We have also backported it to the 1.4 production tree. The problem was that the inodes being created did not have locality leading to a directory having inodes that were spaced far apart from each other. The one place where it really affected

Re: [Ocfs2-users] debugfs.ocfs2 and Feature Incompat

2010-06-01 Thread Sunil Mushran
that was the bottleneck in the first place. On 06/01/2010 11:00 AM, Stefan Priebe - allied internet ag wrote: No not but bonnie is still as slow as before when creating and deleting files. I thought this should be fixed when using indexed-dirs. Stefan Sunil Mushran schrieb: Are you having any

Re: [Ocfs2-users] OCFS2 ERROR: status = - 107

2010-05-26 Thread Sunil Mushran
-107 means the node lost connection with the other node. The messages below appear cut-pastes and not in sequence. So I cannot tell for sure what happened next. What should have happened is that the node would then go into quorum mode followed by recovery mode. Sunil On 05/26/2010 05:38 AM,

Re: [Ocfs2-users] Failover testing problem and a heartbeat question

2010-05-26 Thread Sunil Mushran
When a node dies, the cluster ops pause for the node to be first declared dead followed by recovery. Threshold governs the time it takes to declare the node dead. The higher the value, the longer the pause. ocfs2 does not reset without a log message. Do you have netconsole setup? Messages logged

Re: [Ocfs2-users] Failover testing problem and a heartbeat question

2010-05-26 Thread Sunil Mushran
On 05/26/2010 01:39 PM, Daniel McDonald wrote: ocfs2 does not reset without a log message. Do you have netconsole setup? Messages logged a tick before reset can only be captured by netconsole/kdump etc. Unfortunately no. Here are the two lines in /var/log/message prior to the

Re: [Ocfs2-users] Support and Stability

2010-05-24 Thread Sunil Mushran
Fragmentation has been atop our dev priority list for sometime now. That is, both, reducing it and handling it better when it does get fragmented. Just last week we pushed patches for the same into the newly created 2.6.35. http://oss.oracle.com/pipermail/ocfs2-devel/2010-May/006511.html As

Re: [Ocfs2-users] tunefs.ocfs2 resize issue,

2010-05-20 Thread Sunil Mushran
Versions? On May 20, 2010, at 6:47 AM, stephane lomine stephane.lom...@fr.clara.net wrote: Hello Sorry if ask a question already seen before but i'm new to ocfs2 and was not able to find a proper answer on the web. We re using ocfs2 on a two nodes system with SAN disks. We need to

Re: [Ocfs2-users] fsck.ocfs2 using huge amount of memory?

2010-05-20 Thread Sunil Mushran
http://oss.oracle.com/projects/ocfs2-tools/news/article_8.html We did make a related change in fsck in that release. Do you mind creating a bugzilla for this? Do mention the arch. I can then send you a debug version of the tool that'll tell us why it is behaving like that on your machine. On

Re: [Ocfs2-users] dying ocfs2_wq thread

2010-05-17 Thread Sunil Mushran
The fs is oopsing when trying to remove a entry from the orphan dir. It could be that that orphaned inode is corrupted. You could try running fsck.ocfs2 - fy /dev/sdX. Better if you ping Novell support for assistance. On May 17, 2010, at 6:01 AM, Georg Höllrigl georg.hoellr...@xidras.com

Re: [Ocfs2-users] Kernel panic when deleting a file

2010-05-11 Thread Sunil Mushran
ocfs2 disks Start oracle cluster /on rest of the nodes This plan will need to be done in a quiet period of the system, otherwise the running node will have to much pressure on it. Regards Morten K -Opprinnelig melding- Fra: Sunil Mushran [mailto:sunil.mush...@oracle.com

Re: [Ocfs2-users] ocfs2-tools 1.6

2010-05-11 Thread Sunil Mushran
On 05/10/2010 11:53 PM, Stefan Priebe - allied internet ag wrote: Do you mean the trunk at all? http://oss.oracle.com/git/?p=ocfs2-tools.git;a=summary Yes. 2.) Is there a release date for 1.6? We are planning to do a beta release for el6. No dates as yet. Ping Novell for dates

Re: [Ocfs2-users] ocfs2-tools 1.6

2010-05-10 Thread Sunil Mushran
On 05/10/2010 01:33 PM, Stefan Priebe - allied internet ag wrote: 1.) Is ocfs2-tools 1.6 considered as stable? Mostly. We push changes to the tree only after it passes all the tests. 2.) Is there a release date for 1.6? We are planning to do a beta release for el6. No dates as yet.

Re: [Ocfs2-users] Kernel panic when deleting a file

2010-05-10 Thread Sunil Mushran
av* Sunil Mushran *Sendt:* 7. mai 2010 19:07 *Til:* ocfs2-users@oss.oracle.com *Emne:* Re: [Ocfs2-users] Kernel panic when deleting a file Unsure why you have to build the packages when they are downloadable. Please use the packages provided on oss. On 05/07/2010 06:26 AM, Kristiansen Morten

Re: [Ocfs2-users] Kernel panic when deleting a file

2010-05-10 Thread Sunil Mushran
On 05/10/2010 05:18 PM, Sunil Mushran wrote: Upgrade to el5 u4 kernel. 2.6.18-164.el5. ocfs2 1.2 is provided for all el5 kernels upto u4. Correction. It is provided on el5 upto u3. 2.6.18-128.el5. ___ Ocfs2-users mailing list Ocfs2-users

Re: [Ocfs2-users] Kernel panic when deleting a file

2010-05-07 Thread Sunil Mushran
Unsure why you have to build the packages when they are downloadable. Please use the packages provided on oss. On 05/07/2010 06:26 AM, Kristiansen Morten wrote: Hi again, This time I try attach a jpeg. Regards Morten K -Opprinnelig melding- Fra: ocfs2-users-boun...@oss.oracle.com

Re: [Ocfs2-users] Hardware error or ocfs2 error?

2010-04-29 Thread Sunil Mushran
Cannot say for sure. It could be a deadlock (bug) too. As in, I don't want to blame any one entity without knowing more. If it were up to me, I'd start with the dlm. See which node holds the lock that others are waiting on. Then see why that node is unable to downconvert that lock. As in, if the

Re: [Ocfs2-users] compile error on sles 11

2010-04-27 Thread Sunil Mushran
Werner Flamme wrote: Unfortunately, I do not run the Oracle RAC. The RAC runs with Oracle Unbreakable Linux. I am sorry for that decision, but I can't change it. SAP does not forbid to use Oracle's version of Linux. SAP only says that SAP systems are fully supported on RHEL or SLES only

Re: [Ocfs2-users] compile error on sles 11

2010-04-27 Thread Sunil Mushran
Werner Flamme wrote: this is what I do. But since the version on the RAC server is newer than mine, I cannot mount the filesystem (I quoted the error in a previous mail). That's why I try to compile the sources from Oracle. I'd rather deinstall the SLES version and switch over to the newer

Re: [Ocfs2-users] Problem after upgrade to 1.4.7-1 (Bad magic number in superblock while opening context for device)

2010-04-20 Thread Sunil Mushran
Your setup may not have persistent device naming. You can use blkid or mounted.ocfs2 to discover the ocfs2 devices. Marcus Alves Grando wrote: Hello Guys, After upgrade to 1.4.7-1 my FS does not mount anymore. Just after upgrade rpm, FS mount and works fine, but after reboot server, it does

Re: [Ocfs2-users] compile error on sles 11

2010-04-20 Thread Sunil Mushran
1.4 tree is only meant to be build against EL5 U2+. Not SLES nor any other kernel tree. SLES9/10/11 already includes ocfs2. Werner Flamme wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi, next VM, next trouble :-( Now I work inside a VM with SLES 11. Configuring ocfs2-1.4.7 (via

[Ocfs2-users] OCFS2 1.4.7-1 and OCFS2 Tools 1.4.4-1 released

2010-04-19 Thread Sunil Mushran
All, We are pleased to announce the release of OCFS2 1.4.7-1 and OCFS2 Tools 1.4.4-1 for Oracle's and Red Hat's Enterprise Linux 5 Update 2 and higher. Oracle's Unbreakable Linux Network users who are subscribing to the OCFS2 1.4 packages for Enterprise Linux 5 channel can upgrade to this

Re: [Ocfs2-users] processes in D State

2010-04-19 Thread Sunil Mushran
No NoNone On Fri, 19 Mar 2010 08:48:39 -0700 Sunil Mushran sunil.mush...@oracle.com wrote: In findpath lockname, the lockname needs to be in angular brackets. Did you manage to trap the oops stack trace of the crash? So the dlm on the master says that node 250 has a PR

Re: [Ocfs2-users] OCFS2 1.4.7-1 and OCFS2 Tools 1.4.4-1 released

2010-04-19 Thread Sunil Mushran
Of Sunil Mushran Sent: Monday, April 19, 2010 1:08 PM To: ocfs2-annou...@oss.oracle.com; ocfs2-users Subject: [Ocfs2-users] OCFS2 1.4.7-1 and OCFS2 Tools 1.4.4-1 released All, We are pleased to announce the release of OCFS2 1.4.7-1 and OCFS2 Tools 1.4.4-1 for Oracle's and Red Hat's Enterprise

Re: [Ocfs2-users] new ocfs2 release?

2010-04-15 Thread Sunil Mushran
We are hoping to release it anyday now. Have you filed a bug about your issue? I have no recollection of any reports of such an issue. Orphan scanning has not changed in 1.4.7. File a bz. We'll need to get more information to understand the problem you are encountering. Mailing List SVR wrote:

Re: [Ocfs2-users] new ocfs2 release?

2010-04-15 Thread Sunil Mushran
this could become an issue again. David -Original Message- From: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users-boun...@oss.oracle.com] On Behalf Of Sunil Mushran Sent: Thursday, April 15, 2010 12:14 PM To: li...@svrinformatica.it Cc: ocfs2-users@oss.oracle.com Subject: Re

Re: [Ocfs2-users] memory leak

2010-04-15 Thread Sunil Mushran
Joel Becker wrote: On Thu, Apr 15, 2010 at 12:31:02PM +0200, Kristiansen Morten wrote: I discovered our four node cluster running on RedHat EL5, Ocfs2 1.2.6 and Oracle 10.2.0.3 have memory leak. I suspect ocfs2, but I could be wrong. I suspect ocfs2 because when we run RMAN backup the free

Re: [Ocfs2-users] Ocfs2-users Digest, Vol 76, Issue 17

2010-04-08 Thread Sunil Mushran
Ping Oracle Support. They will be able to answer qs on OracleVM. You are seeing 4 paths because they might be multipathed. Check your iscsi configuration. Use blkid to determine which two paths are the same. e.g. /dev/sdf1: LABEL=label1 UUID=908a0229-88c3-4a0d-b6bc-38c43c6b1461 TYPE=ocfs2

Re: [Ocfs2-users] Kernel Panic, Server not coming back up

2010-04-05 Thread Sunil Mushran
It is having problems doing ios to the virtual devices. -5 is EIO. ke...@utahsysadmin.com wrote: I have a relatively new test environment setup that is a little different from your typical scenario. This is my first time using OCFS2, but I believe it should work the way I have it setup. All

Re: [Ocfs2-users] Kernel Panic, Server not coming back up

2010-04-05 Thread Sunil Mushran
files for the other nodes to see. If the partition was mounted read-only, would that cause another node to think that the read-only node has failed? Thanks, Kevin On Mon, 05 Apr 2010 14:03:52 -0700, Sunil Mushran sunil.mush...@oracle.com wrote: ocfs2 can handle multiple writers

Re: [Ocfs2-users] Ftp server... single file seems locked

2010-04-02 Thread Sunil Mushran
If fs_locks -B is empty, then the processes are not waiting on a cluster lock. Process pegged at 100% cpu means it is actively waiting to acquire a spinlock. Is the other process running? Unfortunately in EL5 there is no clean way to get the kernel stack for a process. echo t

Re: [Ocfs2-users] Odd error on FC12 with ocfs2

2010-03-30 Thread Sunil Mushran
any extra debug logging actually was created. David -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Monday, March 29, 2010 10:23 PM To: Angelo McComis Cc: David Murphy; ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Odd error on FC12 with ocfs2

Re: [Ocfs2-users] Probles resizing a partition

2010-03-30 Thread Sunil Mushran
What's the current size of the partition? The error indicates that the partition has not been resized. Mattia Gandolfi wrote: Hi all, I'm a new OCFS2 user, I'm trying to implement a 2-nodes cluster with a shared fs, and I'm facing an issue while trying to resize an existing ocfs2

Re: [Ocfs2-users] log files appended with NULL when node lost power

2010-03-30 Thread Sunil Mushran
Yes and without the trailing nulls. Florin Andrei wrote: So what would happen if we use 1.4 instead? Would the logger keep logging data and the user would be able to read the new lines normally while node A is down? On 03/30/2010 11:42 AM, Sunil Mushran wrote: What you are seeing

Re: [Ocfs2-users] Extended Attributes support

2010-03-29 Thread Sunil Mushran
I checked. The core support for xattr in tools was added in 1.4.2. We plan on pinging ubuntu to pick up tools 1.6 (when it is ready) that has xattr support enabled by default. Frank Lahm wrote: Hi, although official Oracle documentation indeed suggests that EA should not work if ocfs-tool is

Re: [Ocfs2-users] node B reboots when node A is isolated from the network

2010-03-29 Thread Sunil Mushran
If node A is a lower node number than node B, then the behavior is correct. In a 2 node cluster, if the two nodes cannot talk to each other, the higher node number will fence itself. Also, when a node mounts a volume, it initiates connections to other live nodes. If any connection fails, the mount

Re: [Ocfs2-users] Odd error on FC12 with ocfs2

2010-03-29 Thread Sunil Mushran
used 1.50 for the OCFS kernel module and CentOS 5.3/5.4 use 1.4.4-1. Am I correct in thinking this? David -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Thursday, March 25, 2010 6:46 PM To: David Murphy Cc: ocfs2-users@oss.oracle.com Subject: Re

Re: [Ocfs2-users] Odd error on FC12 with ocfs2

2010-03-29 Thread Sunil Mushran
Connection to 192.168.102.141 port [tcp/cbt] succeeded! -Original Message- From: Sunil Mushran [mailto:sunil.mush...@oracle.com] Sent: Monday, March 29, 2010 5:08 PM To: David Murphy Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] Odd error on FC12 with ocfs2 What

<    1   2   3   4   5   6   7   8   9   10   >