which kernel ?
On 10/21/2015 06:53 AM, tomaz kosir wrote:
Hi All,
cst has ovm crash 2 weeks ago due to network outage. This has been
somehow fixed now.
Now we are searching for some hints how to prevent such issues in the
feature.
What we are thinking about is to increase o2cb timeouts. If
On 09/24/2015 04:27 AM, gjprabu wrote:
Hi All,
Can someone tell me what kind of error and issue is this.
Regards
Prabu GJ
On Wed, 23 Sep 2015 18:26:13 +0530 *gjprabu
>* wrote
Hi All,
On 08/21/2015 03:10 AM, Martin Lund wrote:
Hello,
We have a 3 node OCFS2 cluster, using:
Kernel: 3.16.0-0.bpo.4-amd64
ii ocfs2-tools 1.6.4-1+deb7u1amd64
tools for managing OCFS2 cluster filesystems
Today two of the nodes out of the 3 had
Do you get enospc error if you try to create/cp a file directly to the
mount ?
On 07/16/2015 02:03 PM, Sergey Bolbat wrote:
Hi.
I'm using ocfs2 device with two nodes of 500Gb and I have 350Gb free.
Everything works fine. However I always recieve out of disk space
error in my web application
If 4 didn't help then go down to 2 and see if that helps.
Thanks,
--Srini
On 07/16/2015 02:46 PM, Sergey Bolbat wrote:
No, I don't. I can create files but write speed is very poor.
2015-07-17 3:15 GMT+06:00 Srinivas Eeda srinivas.e...@oracle.com
mailto:srinivas.e...@oracle.com:
Do you
On 10/09/2014 12:53 AM, Joseph Qi wrote:
On 2014/10/9 15:16, Guozhonghua wrote:
Hi Joseph and Srinivas,
We had merged and test the two patches:
1.ocfs2: o2net: set tcp user timeout to max value
8e9801dfe37c9e68cdbfcd15988df2187191864e
2.ocfs2: o2net: don't shutdown
Hi Guozhonghua
can you try pulling following two fixes from mainline to fix your problem ?
1.ocfs2: o2net: set tcp user timeout to max value
8e9801dfe37c9e68cdbfcd15988df2187191864e
2.ocfs2: o2net: don't shutdown connection when idle timeout
Make sure that cluster.conf has all the nodes that mount an ocfs2 fs.
After that you can mount multiple fs(LUNs) on multiple nodes (different
combinations)
On 08/12/2014 08:27 PM, Jacob Godin wrote:
Hi all,
We're looking to have 2 separate iSCSI LUNs serve up two separate
OCFS2 filesystems
Can you please describe what you mean by fails ... did the node panic?
What are the timeout values ? What kernel version are you using ?
On 06/30/2014 10:54 AM, Sitansu Mohanty wrote:
Hi,
what is the answer to the below post, i am facing the same issue.
On 01/20/2014 11:05 PM, Thiruselvam Velayutham wrote:
Dear Experts,
Our DBA team is facing following problem.
We did high availability testing and when we
crash DB
can you please email a complete messages file with all process stacks
(sysrq-t output). I want to see how a process ended up from jbd2 layer
to o2net/ocfs2 layer. It might be due to memory pressure, but that
should be avoided.
Was your system running low on memory when it happened?
On
I am not entirely sure about significant slowdown and cluster outage.
But from your description and information you provided, you are seeing
fragmentation related issues. What is the ocfs2/kernel version and what
is the cluster size/block size of these volumes?
On 07/15/2013 01:33 PM, Gavin
https://oss.oracle.com/git/?p=ocfs2-tools.git;a=summaryYou can find
source rpms at https://oss.oracle.com/el5/SRPMS-updates/
Global heartbeat was added in 2.6.37 so you need a kernel higher than that
On 06/04/2013 11:04 AM, Vineeth Thampi wrote:
Hi Sunil,
Thanks for the update, could you pass
The reason nodes are fenced during network failures is because we need
to guarantee that no i/o's are going to happen from this fenced node. If
you just change the fs to read-only we still cannot guarantee that there
are no inflight-io's from this node from previous writes.
On 05/31/2013
On 05/09/2013 11:59 PM, Guozhonghua wrote:
Thank you, but I have some questions about it.
The IP address of network used by o2net is different with that used by
o2hb, such as the o2net use 192.168.0.7, but the storage network is
192.168.10.7.
So the tcp of o2net disconnected while the o2hb
This is due to a race in lock mastery/purge. I have recently fixed this
problem but haven't yet submitted the patch to mainline. Please file a
Service request with Oracle to get a one-off fix.
On 02/25/2013 10:07 PM, Ramappa, Ravi (NSN - IN/Bangalore) wrote:
Hi,
In a 13 node cluster, the
Robinson, Jayson wrote:
Any work arounds that manually can be done?
no, you need atleast patch
https://oss.oracle.com/pipermail/ocfs2-tools-devel/2013-January/004097.html
You may not need
https://oss.oracle.com/pipermail/ocfs2-tools-devel/2013-January/004096.html
and if you need you can
If you run ocfs2 file system in cluster mode, then all nodes have to
heartbeat to each other on network and storage within a timeout value.
You can increase the timeout values to tolerate huge delays.
On 12/5/2012 1:47 AM, Sébastien RICCIO wrote:
Hi OCFS2 list :)
We are currently using XCP
I believe the problem could be due to fragmentation.
1) Can you run the following script and email me the output
https://oss.oracle.com/~seeda/misc/stat_sysdir.sh
run it as stat_sysdir.sh -d dev
2) can you also do the following and provide me the fs state
mount -t debugfs debugfs
crash looks similar to what patch
https://oss.oracle.com/pipermail/ocfs2-devel/2012-January/008469.html
trying to address. The fix is not yet accepted because as explained in
the patch description we need to fix the master node to skip sending
BAST after receiving unlock message.
regarding
All,
We are pleased to announce the release of OCFS2 1.4.10-1 and OCFS2 tools
1.6.3-2 for Oracle Linux 5 Update 7 and higher and Redhat Enterprise
Linux 5 Update 7 and higher.
Oracle's Unbreakable Linux Network users who are subscribing to the
OCFS2 1.4 packages for Enterprise Linux 5 channel
In ovm, ocfs2 kernel module is part of the kernel rpm itself and hence
you don't need a separate ocfs2 package for kernel module. The errors
suggest that your cluster configuration might be a problem. The
corruption, and another node heartbeating in my slot could because of
a misconfiguration
It should be, but please contact database support for certification
questions.
On 1/27/2012 5:46 AM, Thiruselvam Velayutham wrote:
Hello,
·We are currently on 11gR1 RAC(11.1.0.7) on RHEL 5, using ocfs2
(1.2.9) for datafiles, redo ,archive , ocr and voting disk.
·We plan to upgrade to
and password.
I do sysrq t,s,b and this is what i get:
https://wizja2.tktelekom.pl/ocfs2/2012.01.03-3.1.6/
anything else You need?
Cheers!
-Oryginalna wiadomość- From: srinivas eeda
Sent: Friday, December 23, 2011 10:52 PM
To: Marek Królikowski
Cc: ocfs2-users@oss.oracle.com
Subject
... but
there is no OOPS or something like this on screen.
I don`t restart both server tell me what to do now.
Thanks
-Oryginalna wiadomość- From: srinivas eeda
Sent: Thursday, December 22, 2011 9:12 PM
To: Marek Królikowski
Cc: ocfs2-users@oss.oracle.com
Subject: Re: [Ocfs2-users] ocfs2 - Kernel
But not working
-Oryginalna wiadomość- From: Srinivas Eeda
Sent: Wednesday, December 21, 2011 8:43 PM
To: Marek Królikowski
Cc: ocfs2-users@oss.oracle.com
Subject: Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read
from both
Those numbers look good. Basically with the fixes
-trigger
2. debugfs.ocfs2 -l ENTRY EXIT DLM_GLUE QUOTA INODE DISK_ALLOC
EXTENT_MAP allow
3. wait for 10 minutes
4. debugfs.ocfs2 -l ENTRY EXIT DLM_GLUE QUOTA INODE DISK_ALLOC
EXTENT_MAP off
5. echo t /proc/sysrq-trigger
Thanks,
--Srini
Thanks
-Oryginalna wiadomość- From: Srinivas Eeda
Marek Królikowski wrote:
Sorry i don`t copy everything:
TEST-MAIL1# echo ls //orphan_dir:|debugfs.ocfs2 /dev/dm-0|wc
debugfs.ocfs2 1.6.4
5239722 26198604 246266859
^ those numbers (5239722, 6074335) are the problem. What they are
telling is the orphan directory is filled with flood of
days ago like i say i contact with Sunil Mushran and he tell
me to remove this patches and i do this but don`t help me - take a look:
https://wizja2.tktelekom.pl/ocfs2/
Thanks
-Oryginalna wiadomość- From: Srinivas Eeda
Sent: Tuesday, December 20, 2011 7:58 PM
To: Marek Królikowski
Ivan Savc(ic' | Epix wrote:
*Both* OCFS2 nodes (ie. the one with the network down and the one which
is still up in the partition with quorum) hang with dmesg reporting that
events, ocfs2rec and ocfs2_wq on *both* nodes are blocked for more than
120 seconds.
what are the timeout values.
I think you are running into a known issue. Are there lot of orphan
files in orphan directory? I am not sure if the problem is still there,
if not please run the same test and once you see the same symptoms,
please run the following and provide me the output
echo ls
On 12/8/2011 7:52 AM, Shashank wrote:
I want to know whether _datavolume can be used for non-database
filesystems. Oracle support is recommending the use of _datavolume
along with _netdev
datavolume option was a way to tell older oracle database versions that
the filesystem does support direct
On 10/14/2011 12:23 PM, Florin Andrei wrote:
We upgraded from 1.4 to 1.6 to avoid some FS fragmentation issues (the
No space left on device error, but the FS is not full yet). But it
looks like we really need to enable discontig-bg to take advantage of
that. Our file systems have these
On 07/28/2011 09:20 AM, Matthew Huff wrote:
I migrated a Oracle 11gR2 database from one HP RHEL5 box to another. The only
difference I can tell is that the older box was running 2.16.18-194 kernel,
and the newer box is running 2.6.18.238 kernel (I've tried 2.6.18.238.el5,
238.5.1.el5 and
Devender Narula wrote:
HI team
i am not able to find ocfs kernel for 2.6.18-238.12.1.el5 .. can you
please help me from where i can download this . its bit urgent for me
. i need it for 64 bit linux .
they will be available later today or tomorrow. Please always file a
service request
of the file system, please upgrade and see if that helps.
On 5/25/2011 11:41 PM, Alex Sobrino wrote:
El 20/05/11 10:03, Srinivas Eeda escribió:
That inode number looks suspicious. Appears fs might have got
corrupted, can you umount the filesystem and run fsck.ocfs2 -f to
check for any
On 5/20/2011 12:47 AM, Alex Sobrino wrote:
Hi,
I have a two-node (virtual machines) OCFS2 cluster with a shared disk of
560GB. They are running as web servers, with Moodle as software and
OCFS2 sharing the data.
Right now I'm having an extrange behavior. A directory
Florin Andrei wrote:
Red Hat 5 kernel 64 bit, OCFS2 1.4.4.
Two servers with lots of RAM, filesystem is on a SAN via Fiber Channel,
the Ethernet between servers is pretty fast.
There's a volume on the SAN used for logs. A log collector runs on one
server, appending lines to a variety of
Florin Andrei wrote:
On 05/04/2011 09:44 AM, Srinivas Eeda wrote:
Yes, there is locking involved. Extending a file needs an exclusive
lock. Grepping a file needs read lock. If the same node(lets call it
writer node) does extending and grepping, then grep already has a
compatible lock
can you please run echo t /proc/sysrq-trigger and email me the
messages files.
On 8/27/2010 5:05 PM, Welsh, Jeff D wrote:
I have a 7 node ocfs2 cluster and access to the ocfs file system is
hanging from one of the nodes. Whether it's an rm command, ls
command, find command, etc, any process
May 24 02:10:49 ewhpbc3bl7 kernel: (26,0):o2hb_write_timeout:172 ERROR:
Heartbeat write timeout to device sda1 after 6 milliseconds
May 24 02:10:49 ewhpbc3bl7 kernel: (26,0):o2hb_stop_all_regions:1967
ERROR: stopping heartbeat on all active regions.
It means heartbeat took longer than
my_node1 kernel: ocfs2: Mounting device (253,8) on
(node 1, slot 1) with ordered data mode.
Regards,
Mark
*From:* Srinivas Eeda [mailto:srinivas.e...@oracle.com]
*Sent:* 16 November 2009 16:05
*To:* Thompson, Mark
*Cc:* ocfs2-users@oss.oracle.com
*Subject:* Re: [Ocfs2-users] 2
In a cluster with more than 2 nodes, if a network on one node goes down,
that node will evict itself but other nodes will survive. But in a two
node cluster, the node with lowest node number will survive no mater on
which node network went down.
thanks,
--Srini
Thompson, Mark wrote:
Hi,
*From:* Srinivas Eeda [mailto:srinivas.e...@oracle.com]
*Sent:* 16 November 2009 14:57
*To:* Thompson, Mark
*Cc:* ocfs2-users@oss.oracle.com
*Subject:* Re: [Ocfs2-users] 2 node OCFS2 clusters
In a cluster with more than 2 nodes, if a network on one node goes
down, that node will evict itself
Title: firm_email
Do you run into ENOSPC creating new files or extending existing files?
What is the cluster size? Don't think this may be the issue, but any
files under orphan directory? run (echo "ls //orphan_dir:000X" |
debugfs.ocfs2 device) to check if there are any.
Juan Pablo Marco
) copied, 3,52704 seconds, 71,4 MB/s
Srinivas Eeda escribió:
I don't see any problem with your filesystem configuration. df
reports 63G free space, so you should be able to create files of that
size(approximately). But you are able to create one file of size 5gb
and no more? Or you are able
If you are doing a shutdown/init 6 check your init scripts to make sure
ocfs2 shuts down before network.
maike wrote:
I people, i install the ocfs in my virtual machine, with centos 5.3
and xen. But when i turn off the machine1, the ocfs start the fencing
off the machine2. I read the doc in
Reid,
that error about stackglue module is harmless and can be ignored.
Are you seeing any other errors associated with mounts failing in dmesg?
Is the storage up by then? Can you list ls /etc/rcrunlevel.d/
thanks,
--Srini
McKinley, Reid wrote:
Our ocfs2 mount points will not mount on
Goncalo,
run lsof |grep deleted on all nodes. If that lists any, it means some
process still has the file open. That file will not get cleaned till the
process exits or closes the file
If that command doesn't list any, there is a way(in ocfs2-1.4.2) to
clean, but it needs an unmount/mount
Florian,
the problem here seems to be with network. The nodes are running into
network heartbeat timeout and hence second node is getting fenced. Do
you see o2net thread consuming 100% cpu on any node? if not then
probably check your network
thanks,
--Srini
florian.engelm...@bt.com wrote:
did you use -o datavolume, nointr options for mounting?
keyur patel wrote:
Hello All,
I have installed Oracle Cluster Manager on linux x86-64 nit. I am
using ocfs file system for quorum file. But I am getting following
error. Please see ocfs configureation below. I would appreciate, if
do you notice any errors in messages files during the same time as the
error reported in alert.log?.
please forward the output of echo stat
/arch/2_1917_656008464.dbf|debugfs.ocfs2 device
Can you please file a tar with Oracle and forward me the tar#.
thanks,
--Srini
Diane Petersen wrote:
.
- Original Message -
From: Srinivas Eeda srinivas.e...@oracle.com
To: Tim Hughes tim.hug...@tradefair.com
Cc: ocfs2-users@oss.oracle.com
Sent: Friday, 27 February, 2009 10:21:50 PM GMT +00:00 GMT Britain,
Ireland, Portugal
Subject: Re: [Ocfs2-users] how do I remove ocfs2 orphaned files
please run fsck.ocfs2 -f to clean the orphans.
If you can reproduce it at will, can you please enable tracing
debugfs.ocfs2 -l JOURNAL, SUPER, INODE, NAMEI, DLM_GLUE ENTRY EXIT allow
and send us the messages files from all nodes.
thanks,
--Srini
Tim Hughes wrote:
I am looking for a little
this is a ocfs2 superblock id that gets incremented for each ocfs2
mount. It gets reset whenever ocfs2 modules are re-loaded(on restarting
the node or /etc/init.d/o2cb restart). This is expected behavior and
will not cause any problems until it grows beyond 64K.
thanks,
--Srini
Madhu Velukur
problem appears to be that IO is taking more time than effective
O2CB_HEARTBEAT_THRESHOLD. Your configured value 31 doesn't seem to be
effective?
Index 6: took 1995 ms to do msleepIndex
Index 17: took 1996 ms to do msleep
Index 22: took 10001 ms to do waiting for read completion.
Can you
John,
it's hard to tell without seeing the messages on the surviving node. Do
you remember how many node slots you have created when formating the
volume? Maybe you configured just 1?, if so, use tunefs.ocfs2 to
increase the number of slots
If that's not the problem, please copy paste the
Colin,
space for files are allocated in cluster units. A 64k block is allocated
even if the file is few bytes. You might want to consider lower cluster
size if the file are small.
thanks,
--Srini
[EMAIL PROTECTED] wrote:
We have a 300GB ocfs2 filesystem shared by two unbreakable linux 2.0
Administrator
E-Care Contact Center Services
Phone:(204) 940-6244
Fax:(204) 940-7394
Srinivas Eeda
[EMAIL PROTECTED
59 matches
Mail list logo