Re: [Ocfs2-users] OVM: O2CB_HEARTBEAT_THRESHOLD recommended/max value

2015-10-21 Thread Srinivas Eeda
which kernel ? On 10/21/2015 06:53 AM, tomaz kosir wrote: Hi All, cst has ovm crash 2 weeks ago due to network outage. This has been somehow fixed now. Now we are searching for some hints how to prevent such issues in the feature. What we are thinking about is to increase o2cb timeouts. If

Re: [Ocfs2-users] RBD with OCFS2

2015-09-24 Thread Srinivas Eeda
On 09/24/2015 04:27 AM, gjprabu wrote: Hi All, Can someone tell me what kind of error and issue is this. Regards Prabu GJ On Wed, 23 Sep 2015 18:26:13 +0530 *gjprabu >* wrote Hi All,

Re: [Ocfs2-users] OCFS2 Crash

2015-08-21 Thread Srinivas Eeda
On 08/21/2015 03:10 AM, Martin Lund wrote: Hello, We have a 3 node OCFS2 cluster, using: Kernel: 3.16.0-0.bpo.4-amd64 ii ocfs2-tools 1.6.4-1+deb7u1amd64 tools for managing OCFS2 cluster filesystems Today two of the nodes out of the 3 had

Re: [Ocfs2-users] Out of space with plenty of it

2015-07-16 Thread Srinivas Eeda
Do you get enospc error if you try to create/cp a file directly to the mount ? On 07/16/2015 02:03 PM, Sergey Bolbat wrote: Hi. I'm using ocfs2 device with two nodes of 500Gb and I have 350Gb free. Everything works fine. However I always recieve out of disk space error in my web application

Re: [Ocfs2-users] Out of space with plenty of it

2015-07-16 Thread Srinivas Eeda
If 4 didn't help then go down to 2 and see if that helps. Thanks, --Srini On 07/16/2015 02:46 PM, Sergey Bolbat wrote: No, I don't. I can create files but write speed is very poor. 2015-07-17 3:15 GMT+06:00 Srinivas Eeda srinivas.e...@oracle.com mailto:srinivas.e...@oracle.com: Do you

Re: [Ocfs2-users] Ocfs2-devel Digest, Vol 127, Issue 25

2014-10-09 Thread Srinivas Eeda
On 10/09/2014 12:53 AM, Joseph Qi wrote: On 2014/10/9 15:16, Guozhonghua wrote: Hi Joseph and Srinivas, We had merged and test the two patches: 1.ocfs2: o2net: set tcp user timeout to max value 8e9801dfe37c9e68cdbfcd15988df2187191864e 2.ocfs2: o2net: don't shutdown

Re: [Ocfs2-users] One node hangs up issue requiring goog idea, thanks

2014-09-26 Thread Srinivas Eeda
Hi Guozhonghua can you try pulling following two fixes from mainline to fix your problem ? 1.ocfs2: o2net: set tcp user timeout to max value 8e9801dfe37c9e68cdbfcd15988df2187191864e 2.ocfs2: o2net: don't shutdown connection when idle timeout

Re: [Ocfs2-users] Multiple Filesystems per Node

2014-08-13 Thread Srinivas Eeda
Make sure that cluster.conf has all the nodes that mount an ocfs2 fs. After that you can mount multiple fs(LUNs) on multiple nodes (different combinations) On 08/12/2014 08:27 PM, Jacob Godin wrote: Hi all, We're looking to have 2 separate iSCSI LUNs serve up two separate OCFS2 filesystems

Re: [Ocfs2-users] Cluster lockup when one node fails

2014-06-30 Thread Srinivas Eeda
Can you please describe what you mean by fails ... did the node panic? What are the timeout values ? What kernel version are you using ? On 06/30/2014 10:54 AM, Sitansu Mohanty wrote: Hi, what is the answer to the below post, i am facing the same issue.

Re: [Ocfs2-users] OCFS2 issues during high availability testing

2014-01-21 Thread Srinivas Eeda
On 01/20/2014 11:05 PM, Thiruselvam Velayutham wrote: Dear Experts, Our DBA team is facing following problem. We did high availability testing and when we crash DB

Re: [Ocfs2-users] Deadlocks on OCFS2 file I/O with Ruby Application

2014-01-09 Thread Srinivas Eeda
can you please email a complete messages file with all process stacks (sysrq-t output). I want to see how a process ended up from jbd2 layer to o2net/ocfs2 layer. It might be due to memory pressure, but that should be avoided. Was your system running low on memory when it happened? On

Re: [Ocfs2-users] OCFS2 tuning, fragmentation and localalloc option. Cluster hanging during mix read+write workloads

2013-07-15 Thread Srinivas Eeda
I am not entirely sure about significant slowdown and cluster outage. But from your description and information you provided, you are seeing fragmentation related issues. What is the ocfs2/kernel version and what is the cluster size/block size of these volumes? On 07/15/2013 01:33 PM, Gavin

Re: [Ocfs2-users] Unable to set the o2cb heartbeat to global

2013-06-04 Thread Srinivas Eeda
https://oss.oracle.com/git/?p=ocfs2-tools.git;a=summaryYou can find source rpms at https://oss.oracle.com/el5/SRPMS-updates/ Global heartbeat was added in 2.6.37 so you need a kernel higher than that On 06/04/2013 11:04 AM, Vineeth Thampi wrote: Hi Sunil, Thanks for the update, could you pass

Re: [Ocfs2-users] Avoid node fence and fail gracefully

2013-05-31 Thread Srinivas Eeda
The reason nodes are fenced during network failures is because we need to guarantee that no i/o's are going to happen from this fenced node. If you just change the fs to read-only we still cannot guarantee that there are no inflight-io's from this node from previous writes. On 05/31/2013

Re: [Ocfs2-users] 答复: [Ocfs2-devel] Patch request reviews, for node reconnecting with other nodes whose node number is little than local, thanks a lot.

2013-05-10 Thread Srinivas Eeda
On 05/09/2013 11:59 PM, Guozhonghua wrote: Thank you, but I have some questions about it. The IP address of network used by o2net is different with that used by o2hb, such as the o2net use 192.168.0.7, but the storage network is 192.168.10.7. So the tcp of o2net disconnected while the o2hb

Re: [Ocfs2-users] Kernel panic due to ocfs2

2013-02-25 Thread Srinivas Eeda
This is due to a race in lock mastery/purge. I have recently fixed this problem but haven't yet submitted the patch to mainline. Please file a Service request with Oracle to get a one-off fix. On 02/25/2013 10:07 PM, Ramappa, Ravi (NSN - IN/Bangalore) wrote: Hi, In a 13 node cluster, the

Re: [Ocfs2-users] Need help with global heartbeat.

2013-01-17 Thread Srinivas Eeda
Robinson, Jayson wrote: Any work arounds that manually can be done? no, you need atleast patch https://oss.oracle.com/pipermail/ocfs2-tools-devel/2013-January/004097.html You may not need https://oss.oracle.com/pipermail/ocfs2-tools-devel/2013-January/004096.html and if you need you can

Re: [Ocfs2-users] Avoid node reboot on timeout

2012-12-05 Thread srinivas eeda
If you run ocfs2 file system in cluster mode, then all nodes have to heartbeat to each other on network and storage within a timeout value. You can increase the timeout values to tolerate huge delays. On 12/5/2012 1:47 AM, Sébastien RICCIO wrote: Hi OCFS2 list :) We are currently using XCP

Re: [Ocfs2-users] OCFS2 hanging on writes

2012-10-25 Thread srinivas eeda
I believe the problem could be due to fragmentation. 1) Can you run the following script and email me the output https://oss.oracle.com/~seeda/misc/stat_sysdir.sh run it as stat_sysdir.sh -d dev 2) can you also do the following and provide me the fs state mount -t debugfs debugfs

Re: [Ocfs2-users] null pointer dereference

2012-08-22 Thread srinivas eeda
crash looks similar to what patch https://oss.oracle.com/pipermail/ocfs2-devel/2012-January/008469.html trying to address. The fix is not yet accepted because as explained in the patch description we need to fix the master node to skip sending BAST after receiving unlock message. regarding

[Ocfs2-users] [ocfs2-announce] OCFS2 1.4.10-1 released

2012-06-14 Thread Srinivas Eeda
All, We are pleased to announce the release of OCFS2 1.4.10-1 and OCFS2 tools 1.6.3-2 for Oracle Linux 5 Update 7 and higher and Redhat Enterprise Linux 5 Update 7 and higher. Oracle's Unbreakable Linux Network users who are subscribing to the OCFS2 1.4 packages for Enterprise Linux 5 channel

Re: [Ocfs2-users] ocfs2 cluster fails every few days

2012-04-03 Thread srinivas eeda
In ovm, ocfs2 kernel module is part of the kernel rpm itself and hence you don't need a separate ocfs2 package for kernel module. The errors suggest that your cluster configuration might be a problem. The corruption, and another node heartbeating in my slot could because of a misconfiguration

Re: [Ocfs2-users] OCFS2 certification query

2012-01-27 Thread srinivas eeda
It should be, but please contact database support for certification questions. On 1/27/2012 5:46 AM, Thiruselvam Velayutham wrote: Hello, ·We are currently on 11gR1 RAC(11.1.0.7) on RHEL 5, using ocfs2 (1.2.9) for datafiles, redo ,archive , ocr and voting disk. ·We plan to upgrade to

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2012-01-03 Thread Srinivas Eeda
and password. I do sysrq t,s,b and this is what i get: https://wizja2.tktelekom.pl/ocfs2/2012.01.03-3.1.6/ anything else You need? Cheers! -Oryginalna wiadomość- From: srinivas eeda Sent: Friday, December 23, 2011 10:52 PM To: Marek Królikowski Cc: ocfs2-users@oss.oracle.com Subject

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2011-12-23 Thread srinivas eeda
... but there is no OOPS or something like this on screen. I don`t restart both server tell me what to do now. Thanks -Oryginalna wiadomość- From: srinivas eeda Sent: Thursday, December 22, 2011 9:12 PM To: Marek Królikowski Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] ocfs2 - Kernel

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2011-12-22 Thread srinivas eeda
But not working -Oryginalna wiadomość- From: Srinivas Eeda Sent: Wednesday, December 21, 2011 8:43 PM To: Marek Królikowski Cc: ocfs2-users@oss.oracle.com Subject: Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both Those numbers look good. Basically with the fixes

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2011-12-21 Thread Srinivas Eeda
-trigger 2. debugfs.ocfs2 -l ENTRY EXIT DLM_GLUE QUOTA INODE DISK_ALLOC EXTENT_MAP allow 3. wait for 10 minutes 4. debugfs.ocfs2 -l ENTRY EXIT DLM_GLUE QUOTA INODE DISK_ALLOC EXTENT_MAP off 5. echo t /proc/sysrq-trigger Thanks, --Srini Thanks -Oryginalna wiadomość- From: Srinivas Eeda

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2011-12-20 Thread Srinivas Eeda
Marek Królikowski wrote: Sorry i don`t copy everything: TEST-MAIL1# echo ls //orphan_dir:|debugfs.ocfs2 /dev/dm-0|wc debugfs.ocfs2 1.6.4 5239722 26198604 246266859 ^ those numbers (5239722, 6074335) are the problem. What they are telling is the orphan directory is filled with flood of

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

2011-12-20 Thread Srinivas Eeda
days ago like i say i contact with Sunil Mushran and he tell me to remove this patches and i do this but don`t help me - take a look: https://wizja2.tktelekom.pl/ocfs2/ Thanks -Oryginalna wiadomość- From: Srinivas Eeda Sent: Tuesday, December 20, 2011 7:58 PM To: Marek Królikowski

Re: [Ocfs2-users] OCFS2 problems when connectivity lost

2011-12-20 Thread Srinivas Eeda
Ivan Savc(ic' | Epix wrote: *Both* OCFS2 nodes (ie. the one with the network down and the one which is still up in the partition with quorum) hang with dmesg reporting that events, ocfs2rec and ocfs2_wq on *both* nodes are blocked for more than 120 seconds. what are the timeout values.

Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read from both servers

2011-12-16 Thread srinivas eeda
I think you are running into a known issue. Are there lot of orphan files in orphan directory? I am not sure if the problem is still there, if not please run the same test and once you see the same symptoms, please run the following and provide me the output echo ls

Re: [Ocfs2-users] question about _datavolume

2011-12-08 Thread srinivas eeda
On 12/8/2011 7:52 AM, Shashank wrote: I want to know whether _datavolume can be used for non-database filesystems. Oracle support is recommending the use of _datavolume along with _netdev datavolume option was a way to tell older oracle database versions that the filesystem does support direct

Re: [Ocfs2-users] enabling discontig-bg

2011-10-14 Thread srinivas eeda
On 10/14/2011 12:23 PM, Florin Andrei wrote: We upgraded from 1.4 to 1.6 to avoid some FS fragmentation issues (the No space left on device error, but the FS is not full yet). But it looks like we really need to enable discontig-bg to take advantage of that. Our file systems have these

Re: [Ocfs2-users] ocfs2, redhat 2.6.18-238.x kernel and async i/o hang

2011-07-28 Thread srinivas . eeda
On 07/28/2011 09:20 AM, Matthew Huff wrote: I migrated a Oracle 11gR2 database from one HP RHEL5 box to another. The only difference I can tell is that the older box was running 2.16.18-194 kernel, and the newer box is running 2.6.18.238 kernel (I've tried 2.6.18.238.el5, 238.5.1.el5 and

Re: [Ocfs2-users] Fw: ocfs kernel for 2.6.18-238.12.1.el5

2011-06-02 Thread Srinivas Eeda
Devender Narula wrote: HI team i am not able to find ocfs kernel for 2.6.18-238.12.1.el5 .. can you please help me from where i can download this . its bit urgent for me . i need it for 64 bit linux . they will be available later today or tomorrow. Please always file a service request

Re: [Ocfs2-users] directory problem

2011-05-26 Thread Srinivas Eeda
of the file system, please upgrade and see if that helps. On 5/25/2011 11:41 PM, Alex Sobrino wrote: El 20/05/11 10:03, Srinivas Eeda escribió: That inode number looks suspicious. Appears fs might have got corrupted, can you umount the filesystem and run fsck.ocfs2 -f to check for any

Re: [Ocfs2-users] directory problem

2011-05-20 Thread Srinivas Eeda
On 5/20/2011 12:47 AM, Alex Sobrino wrote: Hi, I have a two-node (virtual machines) OCFS2 cluster with a shared disk of 560GB. They are running as web servers, with Moodle as software and OCFS2 sharing the data. Right now I'm having an extrange behavior. A directory

Re: [Ocfs2-users] read/write performance across cluster

2011-05-04 Thread Srinivas Eeda
Florin Andrei wrote: Red Hat 5 kernel 64 bit, OCFS2 1.4.4. Two servers with lots of RAM, filesystem is on a SAN via Fiber Channel, the Ethernet between servers is pretty fast. There's a volume on the SAN used for logs. A log collector runs on one server, appending lines to a variety of

Re: [Ocfs2-users] read/write performance across cluster

2011-05-04 Thread Srinivas Eeda
Florin Andrei wrote: On 05/04/2011 09:44 AM, Srinivas Eeda wrote: Yes, there is locking involved. Extending a file needs an exclusive lock. Grepping a file needs read lock. If the same node(lets call it writer node) does extending and grepping, then grep already has a compatible lock

Re: [Ocfs2-users] Process hangs when accessing OCFS2 filesystem

2010-08-27 Thread Srinivas Eeda
can you please run echo t /proc/sysrq-trigger and email me the messages files. On 8/27/2010 5:05 PM, Welsh, Jeff D wrote: I have a 7 node ocfs2 cluster and access to the ocfs file system is hanging from one of the nodes. Whether it's an rm command, ls command, find command, etc, any process

Re: [Ocfs2-users] OCFS2 initiating reboot on production machine.

2010-05-28 Thread Srinivas Eeda
May 24 02:10:49 ewhpbc3bl7 kernel: (26,0):o2hb_write_timeout:172 ERROR: Heartbeat write timeout to device sda1 after 6 milliseconds May 24 02:10:49 ewhpbc3bl7 kernel: (26,0):o2hb_stop_all_regions:1967 ERROR: stopping heartbeat on all active regions. It means heartbeat took longer than

Re: [Ocfs2-users] 2 node OCFS2 clusters

2009-11-17 Thread Srinivas Eeda
my_node1 kernel: ocfs2: Mounting device (253,8) on (node 1, slot 1) with ordered data mode. Regards, Mark *From:* Srinivas Eeda [mailto:srinivas.e...@oracle.com] *Sent:* 16 November 2009 16:05 *To:* Thompson, Mark *Cc:* ocfs2-users@oss.oracle.com *Subject:* Re: [Ocfs2-users] 2

Re: [Ocfs2-users] 2 node OCFS2 clusters

2009-11-16 Thread Srinivas Eeda
In a cluster with more than 2 nodes, if a network on one node goes down, that node will evict itself but other nodes will survive. But in a two node cluster, the node with lowest node number will survive no mater on which node network went down. thanks, --Srini Thompson, Mark wrote: Hi,

Re: [Ocfs2-users] 2 node OCFS2 clusters

2009-11-16 Thread Srinivas Eeda
*From:* Srinivas Eeda [mailto:srinivas.e...@oracle.com] *Sent:* 16 November 2009 14:57 *To:* Thompson, Mark *Cc:* ocfs2-users@oss.oracle.com *Subject:* Re: [Ocfs2-users] 2 node OCFS2 clusters In a cluster with more than 2 nodes, if a network on one node goes down, that node will evict itself

Re: [Ocfs2-users] The trouble with the ocfs2 partition continues ...

2009-10-29 Thread Srinivas Eeda
Title: firm_email Do you run into ENOSPC creating new files or extending existing files? What is the cluster size? Don't think this may be the issue, but any files under orphan directory? run (echo "ls //orphan_dir:000X" | debugfs.ocfs2 device) to check if there are any. Juan Pablo Marco

Re: [Ocfs2-users] The trouble with the ocfs2 partition continues ...

2009-10-29 Thread Srinivas Eeda
) copied, 3,52704 seconds, 71,4 MB/s Srinivas Eeda escribió: I don't see any problem with your filesystem configuration. df reports 63G free space, so you should be able to create files of that size(approximately). But you are able to create one file of size 5gb and no more? Or you are able

Re: [Ocfs2-users] Fencing OCFS

2009-10-20 Thread Srinivas Eeda
If you are doing a shutdown/init 6 check your init scripts to make sure ocfs2 shuts down before network. maike wrote: I people, i install the ocfs in my virtual machine, with centos 5.3 and xen. But when i turn off the machine1, the ocfs start the fencing off the machine2. I read the doc in

Re: [Ocfs2-users] OCFS2 mount points not automatically mounting on server reboot

2009-08-06 Thread Srinivas Eeda
Reid, that error about stackglue module is harmless and can be ignored. Are you seeing any other errors associated with mounts failing in dmesg? Is the storage up by then? Can you list ls /etc/rcrunlevel.d/ thanks, --Srini McKinley, Reid wrote: Our ocfs2 mount points will not mount on

Re: [Ocfs2-users] How to clean orphan metadata?

2009-07-27 Thread Srinivas Eeda
Goncalo, run lsof |grep deleted on all nodes. If that lists any, it means some process still has the file open. That file will not get cleaned till the process exits or closes the file If that command doesn't list any, there is a way(in ocfs2-1.4.2) to clean, but it needs an unmount/mount

Re: [Ocfs2-users] ocfs2 fencing with multipath and dual channel HBA

2009-06-05 Thread Srinivas Eeda
Florian, the problem here seems to be with network. The nodes are running into network heartbeat timeout and hence second node is getting fenced. Do you see o2net thread consuming 100% cpu on any node? if not then probably check your network thanks, --Srini florian.engelm...@bt.com wrote:

Re: [Ocfs2-users] [Ocfs2-devel] FW: Oracle 9204 installation on linux x86-64 on ocfs

2009-05-04 Thread Srinivas Eeda
did you use -o datavolume, nointr options for mounting? keyur patel wrote: Hello All, I have installed Oracle Cluster Manager on linux x86-64 nit. I am using ocfs file system for quorum file. But I am getting following error. Please see ocfs configureation below. I would appreciate, if

Re: [Ocfs2-users] Encountered disk I/O error 19502

2009-04-06 Thread Srinivas Eeda
do you notice any errors in messages files during the same time as the error reported in alert.log?. please forward the output of echo stat /arch/2_1917_656008464.dbf|debugfs.ocfs2 device Can you please file a tar with Oracle and forward me the tar#. thanks, --Srini Diane Petersen wrote:

Re: [Ocfs2-users] how do I remove ocfs2 orphaned files that are taking up diskspace ?

2009-03-10 Thread Srinivas Eeda
. - Original Message - From: Srinivas Eeda srinivas.e...@oracle.com To: Tim Hughes tim.hug...@tradefair.com Cc: ocfs2-users@oss.oracle.com Sent: Friday, 27 February, 2009 10:21:50 PM GMT +00:00 GMT Britain, Ireland, Portugal Subject: Re: [Ocfs2-users] how do I remove ocfs2 orphaned files

Re: [Ocfs2-users] how do I remove ocfs2 orphaned files that are taking up diskspace ?

2009-02-27 Thread Srinivas Eeda
please run fsck.ocfs2 -f to clean the orphans. If you can reproduce it at will, can you please enable tracing debugfs.ocfs2 -l JOURNAL, SUPER, INODE, NAMEI, DLM_GLUE ENTRY EXIT allow and send us the messages files from all nodes. thanks, --Srini Tim Hughes wrote: I am looking for a little

Re: [Ocfs2-users] ocfs2 processes that need to be up

2008-04-16 Thread Srinivas Eeda
this is a ocfs2 superblock id that gets incremented for each ocfs2 mount. It gets reset whenever ocfs2 modules are re-loaded(on restarting the node or /etc/init.d/o2cb restart). This is expected behavior and will not cause any problems until it grows beyond 64K. thanks, --Srini Madhu Velukur

Re: [Ocfs2-users] kernel panic - not syncing

2007-01-22 Thread Srinivas Eeda
problem appears to be that IO is taking more time than effective O2CB_HEARTBEAT_THRESHOLD. Your configured value 31 doesn't seem to be effective? Index 6: took 1995 ms to do msleepIndex Index 17: took 1996 ms to do msleep Index 22: took 10001 ms to do waiting for read completion. Can you

Re: [Ocfs2-users] ocfs2 keeps fencing all my nodes

2007-01-18 Thread Srinivas Eeda
John, it's hard to tell without seeing the messages on the surviving node. Do you remember how many node slots you have created when formating the volume? Maybe you configured just 1?, if so, use tunefs.ocfs2 to increase the number of slots If that's not the problem, please copy paste the

Re: [Ocfs2-users] Missing over 100GB of disk space

2007-01-16 Thread Srinivas Eeda
Colin, space for files are allocated in cluster units. A 64k block is allocated even if the file is few bytes. You might want to consider lower cluster size if the file are small. thanks, --Srini [EMAIL PROTECTED] wrote: We have a 300GB ocfs2 filesystem shared by two unbreakable linux 2.0

Re: [Ocfs2-users] Missing over 100GB of disk space

2007-01-16 Thread Srinivas Eeda
Administrator E-Care Contact Center Services Phone:(204) 940-6244 Fax:(204) 940-7394 Srinivas Eeda [EMAIL PROTECTED