We will be releasing one by tomorrow.
Christophe JOBARD (GHH) wrote:
Hi,
Where can i get the RPM's of the OCFS2 software for the new Red Hat
Enterprise 2.6.9-22.0.2 kernel (RH4 Update 2) ?
Many Thanks,
Christophe JOBARD
oops... it'll be fixed today.
Mathieu Avila wrote:
Norbert Tretkowski wrote:
* Mathieu Avila wrote:
I must have missed something obvious, but i can't see what. Any
ideas?
You forgot indention in the configuration file.
Norbert
Thank
the ocfs perspective?
Best regards,
Llorenç Vanaclocha
-Mensaje original-
De: Sunil Mushran [mailto:[EMAIL PROTECTED]
Enviado el: sábado, 11 de marzo de 2006 0:29
Para: Vanaclocha Llorens, Jose Lorenzo
CC: ocfs2-users@oss.oracle.com
Asunto: Re: [Ocfs2-users] Add a new node
Is this a shared disk?
Do:
# echo stats | debugfs.ocfs2 -n /dev/sdX | grep UUID
on all nodes
Is the UUID the same?
Oneill wrote:
Hi!
I working on an oracle cluster but I cannot get fahrer because ocfs2
nodes dont synchronize.
I can create ocfs2 filesystem both mashine if i want but they
Message-
From: Sunil Mushran [mailto:[EMAIL PROTECTED]
Sent: Thursday, March 30, 2006 5:34 PM
To: Vaidya, Sachin
Cc: ''ocfs2-users@oss.oracle.com' '
Subject:Re: [Ocfs2-users] Getting eI am using RHLError when
mountingshar ed OCFS2 device.
Remove vip and mount
In a 2 node setup, if node 0 or 1 crashes, the other node should survive.
The one issue encountered by many users was while shutting down node 0,
node 1 would fence it self. The latter was because of the sequencing of
service shutdowns. We added ocfs2-init script to handle shutdown
sequencing.
You may want to upgrade to 1.2.1. We have done fixes in this area.
Jim Erb wrote:
Can anyone tell me what might be happening here. I have a 3 node
cluster running under RH AS 4 (2.6.9-22.0.1.ELsmp) with ocfs2 v.
1.2.0-1. I've recently implemented elevator=deadline in grub.conf to
fix some
ocfs2-tools includes two init scripts, o2cb and ocfs2. Ensure the
scripts are active and running in the correct sequence. As in,
the startup seq should network, o2cb and then ocfs2. The shutdown
is the reverse of that.
[EMAIL PROTECTED] wrote:
Anyone experience OCFS2 hanging the system on
OCFS2 does not have a disk-based dlm. Net connectivity is a must.
Leonardo de Assis wrote:
Hi,
I have two machines that does not have network connection. If my disk
can be shared between them, there is an way to use disk-based dlm or
any other manner that does not relay on network access?
The hb failure is just the effect of the ios not completing within 12 secs.
The full oops trace gives the last 24 ops and their timings.
One solution is to double up the hb timeout. Set,
O2CB_HEARTBEAT_THRESHOLD = 14
Brian Long wrote:
Hello,
I have two nodes running the 2.6.9-22.0.2.ELsmp
Yes, we are missing that bit. File a bug on http://oss.oracle.com/bugzilla
component ocfs2-tools.
[EMAIL PROTECTED] wrote:
hi,
maybe this is not the place to file a bug, but
I think there is one in /etc/init.d/o2cb.
the script should be used to create the config file
Straße 29
01189 Dresden
Telefon: +49 (0) 351/4021 655
Telefax: +49 (0) 351/4021 696
Mailto: [EMAIL PROTECTED]
Web: www.robotron.de
-Ursprüngliche Nachricht-
Von: Sunil Mushran [mailto:[EMAIL PROTECTED]
Gesendet: Dienstag, 13. Juni 2006 18:14
An: Marco Friebe
Cc: ocfs2-users
It's not a sysctl entry. It won't work that way.
Set the required value in /etc/sysconfig/o2cb
and restart the cluster. Do it on all nodes.
[EMAIL PROTECTED] wrote:
hi,
I'm just thinking about changing the heartbeat threshold of our cluster
online by issuing
# echo 31
Check out http://oss.oracle.com/bugzilla/show_bug.cgi?id=723
Peter McMahon wrote:
All
still working on the use of OCFS2
Yesterday, when we were running autoconfig for an Apps
DB node in a RAC cluster the other node crashed
extract from /var/log/messages...is below...
If anyone
Is it always the mount using node slot 1 that fails? If so, the jbd
superblock
may be corrupted for that slot.
Grow the journal by, say, 1MB. It will reinitialize the JBD superblock
for all
the slots. Either that or just reformat the device.
To see the size of the existing journal, do:
# echo
I would like the entire /proc/meminfo and /proc/slabinfo.
Dump it to a file every 1 min or so.
What version of the kernel/ocfs2?
Paul Jimenez wrote:
On Jun 29, 2006, at 8:22 AM, Brian Long wrote:
On Wed, 2006-06-28 at 17:03 -0500, Paul Jimenez wrote:
I'm getting out of memory
://www.rgmadvisors.com/~pj/memslabinfo.
Kernel is 2.6.16.7 vanilla, and the version of ocfs2 it came with.
--pj
On Jun 29, 2006, at 2:10 PM, Sunil Mushran wrote:
I would like the entire /proc/meminfo and /proc/slabinfo.
Dump it to a file every 1 min or so.
What version of the kernel
of lowmem? will turning on HIGHPTE
be enough to fix this?
--pj
On Jun 29, 2006, at 5:02 PM, Sunil Mushran wrote:
HighFree: 11877028 kB
LowFree:391020 kB
HighFree: 11761892 kB
LowFree:342380 kB
HighFree: 11654316 kB
LowFree:315860 kB
HighFree
Before you can mount, you have to ensure all the nodes
in the cluster access the same device.
#echo stats | debugfs.ocfs2 -n /dev/sdX | grep UUID
should return the same uuid from all nodes.
Once all nodes can see the same device, the you can mount
it on all nodes. There are no passive node(s).
ocfs2-tools 1.2.2 will have the offline-extend feature.
Still in testing.
Karen Penman wrote:
Hi All,
Can anyone tell me if OCFS2 filesystems can be dynamically extended? If not,
is this something that is likely to be available in the future?
Thanks,
Karen
Check dmesg on both nodes.
The error indicates that the connect failed. Ensure the ip addresses
of all nodes in /etc/ocfs2/cluster.conf are correct. Also, that
the conf file is the same on all nodes.
Try pinging the other node on the configured interface:
# ping -I ethX node1
Akin Seigmund
ocfs2-tools 1.2.2 :)
Brian Long wrote:
On Fri, 2006-06-30 at 16:10 -0700, Sunil Mushran wrote:
All,
We are pleased to announce the release of OCFS2 1.2.2.
This release includes some recent fixes, including bugzilla#723
http://oss.oracle.com/bugzilla/show_bug.cgi?id=723.
(Users
OCFS2 relies on the uniqueness of the uuid for it to distinguish between
different volumes. One cannot mount two volumes having the same uuid
on the same node. Infact, one should not do that across the cluster too,
i.e.,
mount two different physical volumes having the same identical uuid.
If you
|| upd_vsize || upd_uuid) {
block_signals(SIG_BLOCK);
ret = ocfs2_write_super(fs);
if (ret) {
Sunil Mushran wrote:
Please could you send it to me again in the diff -u -p format.
Andre Brinkmann wrote:
Sorry,
here the patch as text:
For the Makefile:
39c39
$(LINK
Check the support guide on cluster start/stop in the doc section on
http://oss.oracle.com/projects/ocfs2.
Vicki Luo wrote:
I installed OCFS2 on RHEL4 with ocfs2-2.6.9-22.ELsmp-1.2.2-1.i686.rpm. When
I start ocfs2console and click on Cluster, and then Configure Nodes, it
returns a dialog
-JPH
Sunil Mushran wrote:
The 12 sec default is low. Bump it up to 30 secs or even higher. FAQ
has the details.
The higher you set it to, the longer the brown-out time.
Jeffery P. Humes wrote:
I have an OCFS2 filesystem on a coraid AOE device.
It mounts fine, but with heavy I/O the server
What version of ocfs2 is on the nodes? Do modinfo ocfs2 on all nodes.
The version of OCFS2 shipped with SLES9 SP3 varies with kernel.
Are you using the modules shipped by suse or building them yourself?
Vladan Gunjic wrote:
I've got a strange issue with the following configuration:
Using
Just create a one node cluster.
However, if you were to mount two mirrored volumes on the same node,
you will have problems as detailed in this thread:
http://oss.oracle.com/pipermail/ocfs2-users/2006-July/000630.html
Thanks to Andre, the next drop of ocfs2-tools will have a fix for this
When you added the new node using ocfs2console, did it show up in:
# ls /config/cluster/clustername/node/
I am assuming that it was added in /etc/ocfs2/cluster.conf.
Yes, the docs does not cover this as of now. I will update the
FAQ/user's guide
with the info.
Peter Santos wrote:
-BEGIN
restarted the cluster on node1. (transport
endpoint errors..)
We will definitely try again on a 3rd node, I'm just not clear on what the
sequence of events
should be.
thanks
peter
Sunil Mushran wrote:
When you added the new node using ocfs2console, did it show up in:
# ls /config/cluster
on ocfs2 version 1.2.1 ?
Although they were not directly involved in corruption, maybe indirect ?
Thanks,
Vladan
-Ursprüngliche Nachricht-
Von: Sunil Mushran [mailto:[EMAIL PROTECTED]
Gesendet: Dienstag, 1. August 2006 04:29
An: Vladan Gunjic
Cc: ocfs2-users@oss.oracle.com
Betreff: Re
1. o2net talks tcp. It should be able to handle this.
2. If the cluster is active and the nodes are communicating,
the keepalive packet is rarely sent. It only sends the packet
if it does not hear from the other node for 5 secs.
3. Try the same with 1.2.3. (We made 2 important 1 line fixes.)
4.
ocfs2 requires a shared disk. As in, all nodes must be able to concurrently
read/write to the device.
sorapak Last wrote:
Yes. my disk is an IDE. Would it cause the problems?
Thanks
Sorapak
Alexei_Roudnev wrote:
In my case, after spending few days, I find that my HugeTLB setting (in
Oracle) caused long kernel loop and it forced OCFSv2 to reboot because of
losing connection.
I am keen to hear more about this. Please could you elaborate.
Do, make rpm instead.
Change Copyright to License in the spec file and do make rpm.
I built the following for fc5/x86.
http://oss.oracle.com/~smushran/.fc5-rpms/
Eric Adair wrote:
building on fedora core 5, kernel 2.6.16.-1.2133.FC5smp
Everything builds fine, but I can't find a means to make
# cd /tmp
# wget http://oss.oracle.com/~smushran/.debug/stat_sysdir.sh
# ./stat_sysdir -d sdX sys.out
Email me the output.
amit pansare wrote:
I’ve an issue related to Oracle 10g RAC.
I’ve 2 node cluster each being Dell 2850 Server with RHEL 4.0
I’ve EMC CX300 SAN storage with following
U4 has the fix.
We've tested U2 (and U3) + fix internally already. So we don't feel the
need to rerun the test for the same again.
Brian Long wrote:
Has anyone at Oracle tested the RHEL 4.4 beta or GA kernel to verify the
cfq scheduler is fixed wrt. OCFS2? Or will that testing only begin now
where that IP may exist.
- -peter
Sunil Mushran wrote:
http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#CONFIGURE
Peter Santos wrote:
Folks,
I have a simple 2 node 10gR2 RAC cluster. Each node has a
public/private and virtual IP.
We moved the network
As far as ocfs2 is concerned, bio_add_page() is failing. The one thing that
springs to mind is that o2hb sets bio-bi_sector to 512 bytes and not
the block size.
Kilian CAVALOTTI wrote:
Hi all,
I'm new to OCFS2, but not so new to DRBD. I'd like to use the new
primary/primary feature of DRBDv8
Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions.
We have to teach it new tricks. :)
Fabio Corazza wrote:
Hi there,
I've just setup an EVMS cluster with Heartbeat 2.0.7 and OCFS2.
Everything seems to be working fine except this:
[EMAIL PROTECTED] photos]# mounted.ocfs2 -d
appreciated.
Fabio
Sunil Mushran wrote:
Well, mounted.ocfs2 is dumb... as in, it just scans /proc/partitions.
We have to teach it new tricks. :)
Fabio Corazza wrote:
Hi there,
I've just setup an EVMS cluster with Heartbeat 2.0.7 and OCFS2.
Everything seems to be working fine except
).
- Original Message -
From: Holger Brueckner [EMAIL PROTECTED]
To: Sunil Mushran [EMAIL PROTECTED]
Cc: ocfs2-users@oss.oracle.com
Sent: Friday, September 15, 2006 1:20 AM
Subject: Re: [Ocfs2-users] self fencing and system panicproblem afterforced
reboot
i guess i found the solution. while dumping
-- 0 503 500 0
12-Aug-2006 10:40 0041a286
-Original Message-
From: Sunil Mushran [mailto:[EMAIL PROTECTED]
Sent: Wednesday, September 20, 2006 12:32 PM
To: Matthew Flusche
Cc: Ocfs2-users@oss.oracle.com
Subject: Re: [Ocfs2-users] ocfs2 - disk usage
Yes.
Bill Wells wrote:
All,
Can someone comment on whether it is recommended to use the OCFS2
file system for the admin directories of a RAC database.
Specifically, for bdump, udump, cdump, etc.
This is being considered on RHEL4-U4 with 10gR2 on a 3 node cluster.
Thanks much,
Bill Wells
File a bug on bugzilla (oss.oracle.com/bugzilla) with the full oops trace
and any other information that seems relevant.
Galan Merchan, Martin wrote:
Hello,
I’m working with OCFS2 on Radhat Advanced Server 4 Patch 3 and I had
kernel panics too. I use OCFS2 only for RAC archive logs and RMAN
Yes, the last patch to add this feature is in review. We will release
this as part of ocfs2-tools 1.2.2.
Kerr-Sheppard, Stephen wrote:
Has anyone had to resize a mountpoint in ocfs2. In ocfs version 1 it
was a case of unmounting and using the resizeocfs command. Is this
still the same for
tcpdump -i eth1 -C 10 -W 15 -s 1 -Sw /tmp/`hostname -s`_tcpdump.log
-ttt 'port '
Do this on both nodes before mounting on the second node. Ping me with
the path to the logs.
[EMAIL PROTECTED] wrote:
Hello All,
I have a NAS that I would like to use ocfs2 on. Currently there are
Martin J. Evans wrote:
fine but on selecting cluster/configure nodes I still get dialogue
saying Could not query the state of the cluster stack. This must be
resolved before any OCFS2 filesystemcan be mounted.
Could be because the script is installed as o2cb and not o2cb.init.
Fedora
Still in testing. It is a larger patch than normal and thus requires
more time/effort. Once we are comfortable with it, we will look into
releasing the patch for others to test before releasing 1.2.4.
Jonah H. Harris wrote:
What's the status on this? I've researched Bugzilla, SVN, and the
Thanks for all the replies in the previous usage poll.
One of the chief concerns expressed was the (very) low default disk
heartbeat timeout setting. Well, we want to bump it up but to what?
Here are some qs the answers to which will help us determine that value.
1. What is the your disk
Ping Novell. They issue interim PTF SLES kernels with the required fix(es)
to help users tide over until the formal release.
Needless to add, you need to have Novell Support.
Andy Kipp wrote:
Hello all,
I am running SLES9 with the latest kernel patches (2.6.5-7.282-bigsmp)
and ocfs2 version
# ./configure --with-kernel=/usr/src/kernels/2.6.9-42.X.EL-smp-i686/
# make rhel4_2.6.9-42.X.EL_rpm
The rpms will be in the rpmdir as specified in ~/.rpmmacros.
~$ cat .rpmmacros
%_topdir/rpmbuild/user
%_tmppath /rpmbuild/user/tmp
%_sourcedir /rpmbuild/user/SOURCES
%_specdir
As the ocfs2 home page suggests, when building 1.2.x against mainline
2.6.14 and above, specify GENERIC_DELETE_INODE_NOT_TRUNCATES=1.
Peter Larsen wrote:
I'm running 1.2.2 here - compiled from source, and while I can read
files, trying to delete a file on my OCFS2 volume produces the following:
Fabio Corazza wrote:
Last but not least.. a question for Sunil if he's gonna read this.. when
OCFS2 will support data-on-inode would we need to reformat the file
systems or will the new module be compatible with the 1.4 on-disk data?
I am envisioning a compat flag to be added on existing
Oct 11 05:15:28 vhaispora01 kernel: cciss0: unsolicited abort f7000250
Oct 11 05:15:28 vhaispora01 kernel: cciss0: retrying f7000250
That's where the problem begins. The cciss driver is unable to to
complete the
ios due to a bus reset maybe. Ping HP or whoever your contact is for the
MSA500.
Please file a bugzilla with the details provided. It is easier to manage
bugs
that a way.
Thanks
Christian Schlittchen wrote:
Thanks to syncronous writes on the log-files I finally managed to get
a log of the regular panics we experience.
The setup is as follows: Three blades (IBM HS20)
The first issue could be because you don't have ocfs2-tools 1.2.2. The
earlier
version was missing a line in the ocfs2 init script.
Rafal Maliszewski wrote:
Hi guys
I installed ocfs2 on 4 node (redhat 4u3) on shared FC devices ( EMC
storage ).
So I've noticed several problems:
1. When I
Which version of OCFS2?
Did you run fsck.ocfs2 -f on that device?
Do:
# echo stat 6518860 | debugfs.ocfs2 -n /dev/sdX /tmp/ext.out
Email ext.out.
Andy Kipp wrote:
Anybody have any idea what this error involves? Or how to resolve it?
Oct 30 05:11:24 groupwise-1-mht kernel:
Are you using NFS by any chance? I am looking into bug#790
that also encounters the same error (ESTALE).
Matthew Flusche wrote:
I received the following error messages in the system logs. Is this
anything to be concerned with?
kernel: (4074,0):ocfs2_populate_inode:234 ERROR: Invalid
Replace sdX with the device on which the ocfs2 fs exists. You can use
mount | grep ocfs2 to find that volume.
If the inode on disk is good, one explanation for the issue could be the
lvb bug which was fixed in 1.2.2. Ping Novell to get a PTF kernel with
ocfs2 1.2.3.
Andy Kipp wrote:
Which
So it is bug#790. It just may be a case of unnecessary error messages
for you. I am still investigating it.
Matthew Flusche wrote:
Yes, one of the clustered file systems is shared with nfs.
-Original Message-
From: Sunil Mushran [mailto:[EMAIL PROTECTED]
Sent: Tuesday, October 31
To monitor ocfs2 memory usage, do:
# cat /proc/slabinfo | egrep 'ocfs|dlm|size-256 |size-32 '
ocfs2_lock16226 16 2261 : tunables 120 60
0 : slabdata 1 1 0
ocfs2_inode_cache 22 24 115231 : tunables 24 12
0 : slabdata 8
You are probably looking for a distributed file system. Check
out afs and/or v9fs.
Thad Beier wrote:
Dear Sirs and Madams,
I run a small visual effects production company, Hammerhead Productions.
We'd like to have an easily extensible inexpensive relatively
high-performance
storage network
I would imagine you are using RHEL4. If so, upgrade the ocfs2-tools
to 1.2.2. The previous version of the ocfs2 init script did not always
umount ocfs2 volumes on clean shutdowns leading to this problem.
[EMAIL PROTECTED] wrote:
Hi to all:
In 2 nodes environment I've 'suffered' the 'reboot 1st
of 3,400 IO/sec while the same benchmark with the
same data will max out at 7K+ IO/sec on RAW.
I'll grab the iostat data which we've kept over time and try to make
some sense of it before posting anything additional.
Thanks.
/Brian/
On Thu, 2006-11-09 at 10:20 -0800, Sunil Mushran wrote:
Why
None of these locks are busy. So they should not be the cause of the
problem.
Start with the version of ocfs2. Also, which kernel?
What does top say? Is some process spinning?
Also, what does this stresstest entail?
Stephan Hendl wrote:
Hi,
I use a cluster of 4 nodes with ocfs2 as a
It will be easier if you file a bug on oss.oracle.com/bugzilla with all
the details. Like messages files from all nodes, etc.
Why are you using 1.2.1? 1.2.3 has been out for few months now.
Randy Ramsdell wrote:
Hi,
Maybe someone could elaborate on these re-occuring ocfs2 errors that
always
The quick detect just looks for the superblock which is in the third
block of the device. The full detect looks up the superblock and then
the system directory. In your case it fails to locate the latter.
This is one of the quirks when using an unpartitioned disk and later
partitioning it. The
input is greatly appreciated.
Thanks,
Colin Farley
Network Administrator
E-Care Contact Center Services
Phone:(204) 940-6244
Fax:(204) 940-7394
Sunil Mushran
system).
Well known problem with OCFSv2. One solution is to add 3-d node and use
interface bonding (be sure that interface convergeency time is less that
o2cb timeout).
- Original Message -
From: [EMAIL PROTECTED]
To: Sunil Mushran [EMAIL PROTECTED]
Cc: ocfs2-users@oss.oracle.com
Sent
and everything will change.
- Original Message -
From: Sunil Mushran [EMAIL PROTECTED]
To: Alexei_Roudnev [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]; ocfs2-users@oss.oracle.com
Sent: Wednesday, November 15, 2006 11:03 AM
Subject: Re: [Ocfs2-users] ESX and Unbreakable 2.0 OCFS2 problem
You
is this message getting
10 seconds from?
Also this message is displayed because dbo2 was not able to check into the
hearbeat filesystem right ?
- -peter
Sunil Mushran wrote:
On nodes db01 and db03 hb timed-out at 17:12:49. However, the nodes
did not fully panic. As in, the network
Refer to CDSL (Conext Dependent Symbolic Links) in the OCFS2 user's guide.
Marcel Savelkoul wrote:
Hi,
I'm setting up a 2-node Oracle 9i RAC on OCFS2.
But I have some problems with understanding how the shared Oracle_Home
is being used.
For instance there is the
You are on a very old release of OCFS2. The OCFS2 homepage and FAQ both
list a SLES9 kernel version newer than the one you are using.
But that may not be the reason for the error. My bet is that bdb is
attempting to create
a shared writeable mmap that ocfs2 1.2 does not support.
[EMAIL
strace apache. That may provide us with some clues.
[EMAIL PROTECTED] wrote:
Hello all,
Has anyone installed Oracle Application Server 10.1.2.0.2
Infrastructure tier including the preseeded 10.1.0.4 database (High
Availability option otherwise known as a cold failover cluster) on
OCFS2
ocfs2 supports private mmap r/w and shared mmap readonly.
Shared mmap writeable is the only piece missing. We should have that by 1.4.
Alexei_Roudnev wrote:
There was a clear answer, WHY it did not worked on OCFSv2:
- BerkleyDB and LDAP uses mmap to the files;
- OCFSv2 don't implement it
, they
will never be part of that domain.
Sunil Mushran wrote:
Currently it supports only one cluster.
Peter Santos wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Folks,
When I installed ocfs2 the first time and setup oracle to work
with it, the clustername defaulted to
ocfs2. We
depmod -a ?
Lin Shen (lshen) wrote:
Switched the kernel to 2.6.9-42.Elsmp, still got the same error.
[EMAIL PROTECTED] Desktop]# uname -a
Linux cfs2 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT 2006 i686 i686
i386 GNU/Linux
-Original Message-
From: Sunil Mushran [mailto:[EMAIL
code is pretty well contained and isolated. while we have
discussed tipc,
not sure if we ever gave it a serious look.
lin
-Original Message-
From: Sunil Mushran [mailto:[EMAIL PROTECTED]
Sent: Thursday, January 04, 2007 1:21 PM
To: Lin Shen (lshen)
Cc: ocfs2-users
theoretically yes... but for practical usage go with atleast iscsi
Lin Shen (lshen) wrote:
So w/o shared disk, is it possible to make OCFS2 to work by utilizing
GNBD or etc?
lin
-Original Message-
From: Sunil Mushran [mailto:[EMAIL PROTECTED]
Sent: Thursday, January 04, 2007 2
That and also we've seen similar issues with Broadcom TG3 drivers. We use
Intel E1000 mostly and thus did not experience the same issue.
As far as the configurable net timeouts goes, the patch was added into
mainline on Dec 4th. So it will be available with ocfs2 1.4. We are still
seeing if we
Lot of ink has been spilled on this subject. ;)
Check out the heartbeat section in the FAQ. One easy solution is to
increase the hb timeout to 60 secs...
O2CB_HEARTBEAT_THRESHOLD = 31
We will leaning towards making that number the default in the 1.4 release.
George Liu wrote:
Both systems
You are using two different versions of ocfs2 on the two nodes.
Different enough that they are not network compatible.
It is working as designed.
Consulente3 wrote:
Hi,
I'm new to ocfs2, and in my test's environment, i have:
2 node, becks and vaix
becks can mount ocfs2 fs, but vaix can't.
Looks to be running out of lowmem.
# date
# cat /proc/meminfo
# cat /proc/slabinfo
Run a script that dumps the above every 1 to 5 mins. That should
help explain the cause.
Brian Sieler wrote:
Using 2-node clustered file system on DELL/EMC SAN/RHEL
2.6.9-34.0.2.ELsmp x86_64.
Config:
All,
http://oss.oracle.com/~smushran/.ocfs2-1.2.4-0.2/
The final 1.2.4 should look very close to this drop. We still have one
slippery issue open that we are working on. But, other than that, this
drop is looking good.
The list of patches added post 1.2.4-0.1 is as follows:
r2948: fs - Allow
1. In SLES10, the /config has been moved to /sys/kernel/config. That's
how it
is on mainline.
2. To monitor heartbeat do:
# watch -d -n2 debugfs.ocfs2 -R hb /dev/sdX
This comand will work if you have ocfs2-tools 1.2.2. (Not sure whether
sles10 ships
with 1.2.2 or 1.2.1.) If 1.2.1, do:
# watch
#define EACCES 13 /* Permission denied */
The messages are harmless. Patch to silence them has already been checked
into the 1.2 repo and mainline git.
Matthew Flusche wrote:
I’m seeing the following errors in my two node cluster. Is this
anything to be concerned with?
Host information:
o2net timeout cannot cause the o2hb panic. The two are totally
different. From the outputs, I would guess o2hb is timing out but
I cannot say for sure till I don't see the full logs.
Andy Phillips wrote:
Its worth pointing out that the o2net idle timer is triggering on the
network heartbeat,
:38 -0800, Sunil Mushran wrote:
o2net timeout cannot cause the o2hb panic. The two are totally
different. From the outputs, I would guess o2hb is timing out but
I cannot say for sure till I don't see the full logs.
Andy Phillips wrote:
Its worth pointing out that the o2net idle timer
This was the lvb issue that was fixed long ago. In the 1.2 tree, it was
fixed in 1.2.2.
2.6.18 should definitely have the fix for this.
davide rossetti wrote:
OS: Fedora Core release 4 (Stentz)
KERNEL: Linux rack1.ape 2.6.17-1.2142_FC4smp #1 SMP Tue Jul 11
22:57:02 EDT 2006 i686 i686 i386
wrote:
On 1/23/07, *Sunil Mushran* [EMAIL PROTECTED]
mailto:[EMAIL PROTECTED] wrote:
This was the lvb issue that was fixed long ago. In the 1.2 tree,
it was
fixed in 1.2.2.
2.6.18 should definitely have the fix for this.
it seems it's even more recent:
/var/log/messages.4:Dec
The o2cb script fix is in ocfs2-tools 1.2.2 released Oct 2006.
Ping SUSE for the update.
[EMAIL PROTECTED] wrote:
Using SuSE SP2 Linux running V1.0.8 of OCFS2 and the tools/console
that comes with SP2 distribution.
I am unable to set the* O2CB_HEARTBEAT_THRESHOLD* parameter in the
This is not a fs issue. As in the file must be alright. This is a dlm issue.
The fs is asking the dlm to free the lock and the dlm is stuck. How many
nodes do you have? We've fixed a bunch of dlm bugs since what you appear
to be running.
davide rossetti wrote:
I rebooted the two faulty nodes.
All,
We are pleased to announce the release of OCFS2 1.2.4-2.
This release addresses the lowmem consumption issue that has plagued
many users.
It also addresses few races in the dlm relating to the lockres migration.
The complete list of changes post 1.2.3 is available here:
It could be that the device name is not the same across the two nodes.
Do:
# mounted.ocfs2 -d
on both nodes. Match the device using the uuid. As in, you
should see a device with the same uuid on both nodes. If not,
then the device is not shared.
If you do see the device on both nodes but with
The device needs to be shared. As in, both nodes need to be able
to see the same device concurrently.
Refer to iscsi, fiber channel, aoe, etc.
aibolit 66 wrote:
-Original Message-
From: Sunil Mushran [EMAIL PROTECTED]
To: aibolit 66 [EMAIL PROTECTED]
Date: Mon, 05 Feb 2007 12:46:26
That's the source.
Randy Ramsdell wrote:
Mark Fasheh wrote:
On Tue, Feb 06, 2007 at 10:18:51AM -0500, Randy Ramsdell wrote:
Is source available?
http://oss.oracle.com/projects/ocfs2/dist/files/source/v1.2/ocfs2-1.2.4.tar.gz
--Mark
--
Mark Fasheh
Senior
The following patch will address this issue. The fix will be provided
with the next tools release.
Index: libocfs2/include/ocfs2.h
===
--- libocfs2/include/ocfs2.h(revision 1269)
+++ libocfs2/include/ocfs2.h(revision 1270)
The datavolume code is not in mainline. But you should
be able to get Oracle RDBMS to work with it. Ensure the
init.ora paramater filesystemio_options is set to direct_io.
Ivo Maya wrote:
Hi,
I need to mount ocfs2 with datavolume option on open
SuSE 10.2 Machines.
ocfs2 is 1.3.3 version and
What does dmesg say?
Randy Ramsdell wrote:
Hi,
Everything compiled correctly for the ocfs2 package, but so far the
modules will not load with the well known module symbol error.
FATAL: Error inserting ocfs2
(/lib/modules/2.6.16.27-0.6-smp/kernel/fs/ocfs2/ocfs2.ko): Unknown
symbol in module,
1 - 100 of 943 matches
Mail list logo