One point I missed out earlier is that ocfs2 should not be viewed as
an Oracle only product. When the fs was pushed into the mainline kernel,
it was because we wanted it to become a community project. Like Linux
itself.
Until some time ago, we would credit non-oracle people who contributed
patches
Unsure what the confusion is. The answer was correct. fscat can read
certain unmounted fs. The list is mentioned on the home page. ext2, ext3,
ocfs and ocfs2. It does not write to these file systems when they are
unmounted. It can write to any mounted file system.
Mehmet Can ÖNAL wrote:
> Hi ever
Node 0 does not know about this node. Run the o2cb_ctl command
on node 0 to add a node to a live cluster as listed in 1.4 user's guide.
Andrew Deagman wrote:
> I am receiving the following error when I attempt to add a node to the
> cluster:
>
> o2net: connected to node bapp02 (num 5) at 10.10.16
Ask Novell whether you should upgrade or reinstall the OS. It is
afterall a large jump. From sles9 sp3 to sles10 sp2.
But when you do the change, install the latest sles10 sp2 kernel.
The file system can be used as is. There is no need to enable new features
immediately. Most of them are not usef
OTN has a forum for ASM. Please post this qs on that forum.
On Feb 7, 2009, at 6:51 AM, Karim Alkhayer wrote:
> Hello All,
>
>
>
> Are there any benchmarks with respect to performance with respect to
> ASM over OCFS2 vs. standard locally managed tablespaces?
>
> In our environment, data files
Bruce MacDonald wrote:
> I'm playing around with OCFS2 as a local filesystem, as an alternative
> to using XFS for a multi-terabyte database. Using OEL 5.2 from the
> demo DVD, I download the latest OCFS2 1.4 packages and format the
> filesystem, but when I try to mount it, I get a "file not fo
at 13:41, Sunil Mushran wrote:
>
>> Bruce MacDonald wrote:
>>> I'm playing around with OCFS2 as a local filesystem, as an
>>> alternative to using XFS for a multi-terabyte database. Using OEL
>>> 5.2 from the demo DVD, I download the latest OCFS2 1.4 packa
No, this is not configurable. We have to fence else the processes will hang.
From your description it appears it is rebooting because the hb ios are not
completing within the timeout. What is your current setting?
O2CB_HEARTBEAT_THRESHOLD in /etc/sysconfig/o2cb.
Mehmet Can ÖNAL wrote:
>
> *Hi ev
bug#6802646
The same fix is available in 1.2.9-1.
http://oss.oracle.com/projects/ocfs2/news/article_18.html
The bugzilla#1023 describes the problem. The above fix helps
but does not fix the core issue entirely. We are working on
further improving inode allocations. But the reality is, as
long as w
T deny SUPER off;
> [r...@oeltest ~]# tail -3 /var/log/messages
> Feb 11 23:37:13 OELTEST kernel: sdb: Write Protect is off
> Feb 11 23:37:13 OELTEST kernel: SCSI device sdb: drive cache: write back
> Feb 11 23:37:13 OELTEST kernel: sdb: sdb1
>
> Any ideas? As I said before, th
eason i asked this
> question. Is there any tip or tricks that you would give?
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Wednesday, February 11, 2009 7:36 PM
> To: Mehmet Can ÖNAL
> Cc: ocfs2-users@oss.
$ tunefs.ocfs2 -N /dev/sdc1
Check the man pages for mkfs.ocfs2 and tunefs.ocfs2. You can also
refer to the user's guide and the faq.
Carl J. Benson wrote:
> Hello.
>
> I'm trying to add a fifth openSUSE 11.1 node to my existing
> cluster of four nodes. My software is:
>
> ocfs2-tools-1.4.1-6.9
t; Thanks again for your help. Still nothing in dmesg.
>
> I did get 2 trace files, which I'm sending you separately.
>
> Cheers,
>Bruce
>
> Sunil Mushran wrote:
>> Do:
>>
>> $ dmesg -n8
>> $ debugfs.ocfs2 -l ENTRY EXIT SUPER all
Andreas Dilger wrote:
> On Feb 20, 2009 20:23 +0300, Kirill Kuvaldin wrote:
>> I'm evaluating different cluster file systems that can work with large
>> clustered environment, e.g. hundreds of nodes connected to a SAN over
>> FC.
>>
>> So far I looked at OCFS2 and GFS2, they both worked nearly th
Kirill Kuvaldin wrote:
> What are the practical/theoretical limits for number of nodes for
> shared disk file systems like ocfs2/gfs2?
Theoretical limit is around 254 or so. Practical limit depends on
the hardware. Meaning, you cannot just add nodes. You have to
ensure the interconnect and the sto
On Mon, Mar 09, 2009 at 11:11:13AM -0600, Bret Palsson wrote:
> Can I have more than 255 nodes specified in the cluster.conf?
No. 255 is the limit.
>
> I need to have 255 nodes on one volume which I have already setup, but
> I have 3 more volumes that won't be connected by any of those 255
>
Known issue. We have a potential fix for it. It is in testing.
How often do you hit this?
On Mon, Mar 09, 2009 at 06:52:57PM -0400, Ward Fenton wrote:
>We have been experiencing unplanned outages on a subset of the
>clustered systems we have deployed to support SAP. The following
>cap
Impossible to determine the cause with what you have provided. File a
bugzilla and attach messages from all nodes. No exceptions. If you have
netconsole setup (you should) attach those logs. That way we'll know if
the nodes oopsed and if so what the stack was.
Sunil
On Fri, Mar 13, 2009 at 03:01:
Setup a netconsole server to catch the oops trace. Have you set
/sys/kernel/panic_on_oops to 1?
On Tue, Mar 24, 2009 at 06:09:22PM +0200, Cristian Gae wrote:
> Hello
>
> We have a 9 nodes ocfs2 cluster used for http serving.
>
> Sometimes when we want to reboot one of the nodes, it happens to ke
Setup netconsole to catch the oops log.
The message you have provided shows the node death detection
only. Not the cause of the the node death. The netconsole log of the
oopsed node will tell us as to why it oopsed.
Christoph Ackermann wrote:
> Hello.
>
> We used a ten host cluster for a vmware-s
umount is supposed to stop the heartbeat. In bz1053, ocfs2_hb_ctl was
segfaulting.
Are you seeing any segfaults or any other errors during umount?
Also, run the following before and after umount:
$ ocfs2_hb_ctl -I -d /dev/sdX o2cb
Email me the output.
Nikola Ciprich wrote:
> Hello Tao,
> and th
_ctl -I -d /dev/vgshared/lvs
> 2A5D351D0A934061BBC6B5392A30187E: 1 refs
> [r...@vbox4 ~]# umount /home/LVS
> [r...@vbox4 ~]# ocfs2_hb_ctl -I -d /dev/vgshared/lvs
> 2A5D351D0A934061BBC6B5392A30187E: 1 refs
>
> nik
>
> On Fri, Apr 03, 2009 at 10:21:33AM -0700, Sunil Mushran wrote:
>
>> umount i
t - the binary is there...
> n.
>
> On Fri, Apr 03, 2009 at 02:27:34PM -0700, Sunil Mushran wrote:
>> Do:
>> $ cat /proc/sys/fs/ocfs2/nm/hb_ctl_path
>>
>>
>> Nikola Ciprich wrote:
>>> Hi Sunil,
>>> thanks for reply..
>>> I don't o
; Anyways Sunil thanks a lot for Your help!
>
> On Sun, Apr 05, 2009 at 07:31:52AM -0700, Sunil Mushran wrote:
>
>> Email me the ouput of:
>> $ mounted.ocfs2 -d
>>
>> Also, does hb stop using uuid work?
>> $ ocfs2_hb_ctl -K -u o2cb
>>
>> Lastly,
Yes. This issue has nothing to do with taf or asm. You are seeing
transient EIOs during archiver writes. Follow Srini's suggestion
of filing a SR and then pinging him.
The other issue about instances altering the service names is
unrelated. Atleast we should consider it unrelated as they have
noth
s version 95, which doesn't
> create them for me.
> Version 127 I used for tests already creates them correctly..
> BR
> nik
>
> On Mon, Apr 06, 2009 at 12:08:34PM -0700, Sunil Mushran wrote:
>
>> AFAIK, this is not an issue on (rh)el4/sles9. It could be that
File a bugzilla (oss.oracle.com/bugzilla) for this issue. Attach
the stack trace.
Also attach the output of the following.
$ find /lib/modules/`uname -r`/kernel/fs/ocfs2 -name \*.ko -exec objdump
-DSl {} >/tmp/ocfs2.syms \;
Sunil
Konstantin Tikhonov wrote:
> Нi,
> I have a cluster with 5 nodes
Please file a bugzilla. Add this stack trace to it.
http://oss.oracle.com/bugzilla
Also add any detail about your environment that you feel
could be relevant. Size of cluster, number of mounts, etc.
Thanks
Sunil
James Masson wrote:
> Hi list,
>
> We've just had a kernel panic/reboot on one of ou
While a node can only be in one cluster, it can be in many
different dlm domains (or lock name spaces), each of which
can have a different collection of nodes. But in the end,
the sum total of all nodes in all domains will still part
of one cluster.
What are you trying to do?
Sunil
Søren Kröger
Søren Kröger wrote:
> I'm trying to split up our big OCFS2 filesystem into 3 separate LUN's,
> since there are only a limited amount of nodes which need access to
> the different parts of the OCFS2 filesystem.
> One "Master" server with RW access should still be able to mount all 3
> OCFS2 LUN's
ave mounted the particular filesystem?
>
> Søren
>
> On 28/04/2009, at 22.21, Sunil Mushran wrote:
>
>> Søren Kröger wrote:
>>> I'm trying to split up our big OCFS2 filesystem into 3 separate
>>> LUN's, since there are only a limited amount of nodes which
McKinley, Reid wrote:
>
> We have installed OCFS2 1.4.1 and for some reason we can only get the
> mount point mounted on 1 of 2 nodes. The 2^nd node shows that the
> heartbeat is not active.
>
>
>
> [r...@nyclx2 ~]# service o2cb status
>
> Driver for "configfs": Loaded
>
> Filesystem "configf
What does "mounted.ocfs2 -d" say on both nodes?
Not /var/log/dmesg. It is /var/log/messages. You could instead
run "dmesg". This is important as it will tell you why the
mount failed.
Sunil
McKinley, Reid wrote:
> Thank you!
>
> Everything appears to be fine then, except that we cannot mount an
pr 29 12:01:17 nyclx2 kernel: ocfs2: Unmounting device (8,0) on (node
> 255)
>
> Thanks again,
> Reid
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Wednesday, April 29, 2009 4:32 PM
> To: McKinley, Reid
> Cc: ocfs2-users@oss.
Andrew (Anything) wrote:
> Ive been testing using bonnie++ -n 50:1024:0:10 -s 0. Is this a bad way to
> test?
> Some raw results follow later in case you want them.
>
> Obviously ocfs2 should be slower than ext3.
> But I guess I expected a single node ocfs node to be only doing internal
> stuff wit
http://oss.oracle.com/~smushran/.debug/scripts/stat_sysdir.sh
Please file a bugzilla and attach the output of the above script.
http://oss.oracle.com/bugzilla
I think I know the issue. No, it is not related to blocksize/clustersize.
Sunil
Nigel Bishop wrote:
>
> Afternoon,
>
> We have an ocfs2
Gavin Hamill wrote:
> ocfs2 1.4 has a maximum of 32000 files in any single directory - we got
> bitten by this bug recently. If you're talking about 5 million files,
> then is there's a possibility you've encountered this limit?
Incorrect. The limit is 32000 sub-dirs in a directory. There is
no sp
Did you run fsck with the force flag?
$ fsck.ocfs2 -f /dev/sdX
By default, fsck only replays the journals.
Paul Taylor wrote:
> Hi
>
> errors like the one listed below have been coming through in our logs on
> a daily basis. We tried to run fsck.ocfs2 over the file system bet it
> thinks that i
Damon Miller wrote:
> We're running a 3-node OCFS2 1.2.9 cluster with a 5-TB iSCSI block device as
> the backing store. All machines are running CentOS, with the iSCSI target
> running CentOS 5.2 and the initiators running CentOS 4.7. The purpose of the
> cluster is to evaluate alternatives to
Damon Miller wrote:
> The two servers are actually connected to the same switch. We are using
> iptables for basic packet filtering on all of our hosts, but TCP/ is open
> on all machines participating in the cluster. iSCSI is also enabled on
> TCP/3260. Here are the relevant excerpts fro
Christian van Barneveld wrote:
> Our OCFS2 cluster has been stable for approx 8 months, but since this week it
> went wrong. First we had high load problems. We had this problem because a
> couple of directories got filled with files, 1 directory over 1,5 milion
> files (symlinks) and NFS (mount
McKinley, Reid wrote:
>
> We are trying to figure out why our OCFS2 mount points will not
> automatically mount on reboot.
>
> We are on OEL version 2.6.18-92.el5 and we are using multipathing.
>
> Here are the OCFS2 entries in /etc/fstab.
>
> /dev/dm-2 /oracw oracle_clusterware datavolume,nointr,
Christian van Barneveld wrote:
> No, I don't have the full output, but I still have the snapshots that I've
> made before teh FSCK. I've mounted it at a different server and ran a
> (readonly) FSCK. See attached output.
The output shows i/o errors. It is unable to read the blocks beyond
a certai
Brian Kroth wrote:
> Luis Freitas 2009-05-20 10:46:
>>I am not aware of any filesystem that can withstand a online fsck.
>>Sun ZFS can do online correction, but it doesnt have a fsck tool.
> I hear btrfs will support this. It may be a feature that's easier to
> accomplish with copy on wri
Brian Kroth wrote:
> That's what's always held me back from doing this as well. Will the
> common stack be the openais stack (ie: the so called user stack), the
> o2cb stack, or something completely different?
Currently the o2cb userspace clusterstack is directed towards supporting
the native clu
Use sles or (rh)el. It won't work on other distros.
On May 24, 2009, at 3:43 PM, sundar mahadevan wrote:
> To add some more details to this issue:
>
> /u01/oradata/orcl/orcl/cmquorumfile mentioned in the error message is
> accessible from the shell prompt. I even checked the file
> $ORACLE_HOME/
The ocfs2 kernel driver is missing. Read the user's guide or the FAQ
to learn how to install the driver.
On May 25, 2009, at 8:16 AM, Devender Narula
wrote:
>
> HI Team
>
> When ever i try to configure o2cb . i get below mention proble.. its
> a production installation and your quick help
>
> ocfs2console-1.4.1-1.el5
>
> ocfs2-tools-1.4.1-1.el5
>
> [r...@eregtest2 software]# uname -a
>
> Linux eregtest2.admin.abdn.ac.uk 2.6.18-92.el5 #1 SMP Tue Apr 29
> 13:16:15 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux
>
>
> --- On Mon, 5/25/09, Sunil Mushran wr
There are no known issues with using ocfs2 on a jumbo frame enabled
private network.
On May 26, 2009, at 8:38 AM, Sridhar Avantsa wrote:
> When using OCFS2 in a Oracle RAC set, one would configure OCFS2 to
> use the private interconnect address ( in cluster.conf).
> Are there any known issue
Refer to the section on oracle rdbms in the 1.4 user's guide.
Specifically mount options.
On May 27, 2009, at 4:55 AM, Devender Narula
wrote:
>
> Hi team
>
> I had installed OCFS2 on RHEL5.0 . every thing looks fine but when I
> was installing CRS on the node I got error message OCFS2 is n
kernel version, ocfs2 version?
$ uname -a
$ modinfo ocfs2
$ rpm -qa | grep ocfs2
Kees Hoekzema wrote:
> Hello List,
>
> At the moment I'm running a 7-node ocfs2 cluster on a Dell MD3000i (iscsi)
> NAS. This cluster has run fine for well over a year now, but recently one of
> the older and more u
Kernel: 2.6.26-1-amd64 x86_64
>
> modinfo ocfs2:
> version:1.5.0
> description:OCFS2 1.5.0
> srcversion: B19D847BA86E871E41B7A64
> vermagic: 2.6.26-1-amd64 SMP mod_unload modversions
>
> ocfs2-tools:
> Version: 1.4.1-1
>
> Tia,
> Kees Hoekzema
You are using ocfs2 atop lvm - a non-cluster-aware volume manager.
A lot of things can go wrong in this combination. Quite a few have
been reported on this forum.
debugfs.ocfs2 has commands dump and rdump that allows users to
read the files directly off the disk. Use it to recover your data.
khai
jhonyl wrote:
> I am trying to figure out about locking under ocfs2.
>
> I read in the 1.4 ocfs2 pdf doc file that ocfs2 1.4 support flock but
> not fcntl locks, and in a message, I read that ocfs2 rely on vfs for
> fcntl, and I read something about being able to get fcntl locks but not
> with o
jhonyl wrote:
> If I will add pacemaker or cman package to my current OS, since
> cluster fcntl is probably a new feature, is there a minimum version
> number for fcntl to be supported?
The support for clustered fcntl in ocfs2 was added in 2.6.27.
___
jhonyl wrote:
> Thanks for your reply.
>
> Good to know... time to upgrade the kernel.
>
> Is there a minimum required pacemaker or cman version too?
> With pacemaker does it need heartbeat or openais for this? or either?
> And if it maters what is the minimum version required for openais or
> hea
khaije rock wrote:
> 90% of my way through the recovery and it turns out that each of the
> more volume-wide rdump attemps were chocking at the same specific
> point: a symlink pointing to a directory on a different filesystem
> that then descended back into the ocfs volume.
>
> Looking like thi
McKinley, Reid wrote:
>
> Ever since we have installed OCFS2, we have had extremely slow
> performance in the ocfs2console. It can take us over 30 minutes to do
> the simplest tasks.
>
>
>
> We do not have this type of performance with other xwindows
> applications on our server.
>
>
>
> D
McKinley, Reid wrote:
> No, "mounted.ocfs2 -d" comes back in less than 5 seconds.
>
>
You said it taking 30 mins to do any action. Can you expand
on that? As in, possibly walk us through your steps.
___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle
The connect requests are not getting through. Do you
have any firewalls setup? Is iptables running? If so, either
shut it down or allow traffic on the o2cb port.
McKinley, Reid wrote:
>
> We are having trouble getting the 2^nd node in our 2 node RAC
> configuration to have an active O2CB heartbea
minues. Then, I can select "format".
>
> Let me know if you need further details.
> Thanks,
> Reid
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Wednesday, June 03, 2009 12:48 PM
> To: McKinley, Reid
> Cc: o
Which cluster stack are you using? o2cb or pacemaker?
Shaffin Bhanji wrote:
> I have a 2 node cluster setup but am at a loss as to why I cannot
> mount a shared ocfs2 filesystem on both nodes being shared by iSCSI?
> Am I wrong in understanding that this can be achieved?
>
> I am using OpenAIS und
]# service iptables status
> Firewall is stopped.
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Wednesday, June 03, 2009 12:57 PM
> To: McKinley, Reid
> Cc: ocfs2-users@oss.oracle.com
> Subject: Re: [Ocfs2-users] O2CB heart
ed
> Driver for "ocfs2_dlmfs": Loaded
> Filesystem "ocfs2_dlmfs": Mounted
> Checking O2CB cluster tiaa: Online
> Heartbeat dead threshold = 31
> Network idle timeout: 3
> Network keepalive delay: 2000
> Network reconnect delay: 2000
> Checking O2CB
> node:
> ip_port =
> ip_address = 192.168.0.217
> number = 1
> name = nyclx2
> cluster = tiaa
>
> cluster:
> node_count = 2
> name = tiaa
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@or
Actually, it is not complex.
o2cb timeouts: If not using multipathing/netbonding, leave the timeouts
as it. If using multipathing, double the disk hearbeat to 120 secs.
If using netbonding, double the network idle to 60 secs. Ensure your
private network has no loops to prevent spanning tree protoc
Are you sure you are mounting the same volume on both nodes?
Do on both nodes:
debugfs.ocfs2 -R "stats" /dev/sdX | grep UUID
It should be the same.
Ensure you don't have any local iscsi caching enabled.
Bengtsson Anders wrote:
>
> I’m trying to mount a ocfs2 volume (created on sles11) on my sle
florian.engelm...@bt.com wrote:
> We tried to use ocfs2 with Vserver clustered with Heartbeat. But
> Vservers need barrier=1. That did not work on our shared storage with
> ocfs2 but I guess this is no ocfs2 problem it is a device mapper problem
> because we need to use multipath and LVM2, isn't it
Sure. One can use ocfs2 to host almost anything. The one exception
is the crs_home. crs_home needs to be on a local volume.
OCFS 1.2/1.4 has two limits. Like ext3, the number of sub-directories in _a_
directory cannot exceed 32000. (There is no limit to the number of subdirs
in a volume.) The othe
ensure iptables is either off or has rules for the interconnect traffic.
use tcpdump to see if packets are coming thru. The connect request
is initiated between two nodes is initiated when both of them first
mount a common volume. Also, the connect request is always from
the higher node to the low
There is another scheme (less elegant but probably quicker to deploy) that
was described in this list by a user.
As root run blkid. Then edit /etc/blkid.tab and remove all sd devices that
correspond to the emcpp devices. Ensure pp is enabled. Rerun blkid. This
time you should see the pp devices in
iranha which invokes
> iptables. What should I add to iptables to enable interconnect
> traffic?
>
> TIA
>
> John
>
> On Wed, 2009-06-10 at 10:18 -0700, Sunil Mushran wrote:
>> ensure iptables is either off or has rules for the interconnect
>> traffic.
>>
ree working nodes in order to run /etc/init.d/o2cb offline on each one
> followed by /etc/init.d/o2cb start?
>
> Best Regards
>
> John
>
> On Thu, 2009-06-11 at 07:33 -0700, Sunil Mushran wrote:
>
>> Add a rule to allow traffic on port (or whatever it is
Please file a bugzilla in oss.oracle.com/bugzilla.
Saul Gabay wrote:
>
> We have a 2 node OCFS2 cluster running Oracle 10g, both nodes crashed.
>
>
>
> Node 1 because it panic running IOSTAT, the second node crashed with
> this error message you can see below.
>
>
>
> I was hoping to see a ne
All,
We are pleased to announce the release of OCFS2 1.4.2-1 and OCFS2 Tools
1.4.2-1 for Oracle's and Red Hat's Enterprise Linux 5 Update 2 and higher.
Oracle's Unbreakable Linux Network users who are subscribing to the "OCFS2
1.4 packages for Enterprise Linux 5" channel can upgrade to this relea
Please file a bugzilla and _attach_ this oops trace. Also mention all
the version numbers.
On Jun 17, 2009, at 2:30 AM, "McDonald, Stuart" > wrote:
Hi
We have a two-node RAC cluster, which uses ASM for the database
storage, but is using OCFS2 to mount a couple of file systems for a)
th
Brian Kroth wrote:
> Sunil Mushran 2009-06-16 16:38:
>
>> LOOKING AHEAD
>>
>> We are aiming to release OCFS2 1.6 later this year. This release will
>> include the features that we have worked on over the past year. These are:
>>
>> 1. Extended Attrib
Senmiao Chen wrote:
> Whenever I tried to mount an OCFS2 volume I got the following message,
> "ocfs2_fill_super:1016 ERROR: User quotas were requested, but this
> filesystem does not have the feature enabled." Just wonder how to enable
> this feature. The ocfs2 wiki says "you'll need support in
ocfs2 is a shared disk cluster file system. All nodes need to have
access to the disk. Typical technologies involved are fiber channel
and iscsi. If you are in a virtual environment, then you could present
a local device as shared to multiple guests.
On Jun 24, 2009, at 2:51 AM, "sheri...@ho
Not sure why you think it is trying to connect to itself. o2net
connects to other nodes only.
Raheel Akhtar wrote:
>
> Hi,
>
> I have OCFS2 Cluster of 5 nodes running on RHEL 5.2 (kernel version
> 2.6.18-128.1.10.el5). I am getting error like
>
> Jun 24 09:26:54 alf2 kernel: (2095,0):o2net_connec
Do you have a separate network path for drbd traffic? If you do
not, then you are probably overloading the network. In this case,
I believe drbd is unable to replicate the ios fast enough and thus
is blocking the o2cb disk heartbeat. One workaround is to increase
the O2CB_HEARTBEAT_THRESHOLD to mor
The nodes are not frozen. The processes that are attempting to talk
to the "disconnected" node are waiting for that node to reply, failing
which, to die. The default timeout for the disk heartbeat is 60 secs.
If that node simply died, the other nodes would have deemed the node
dead after 60 secs,
logs.
Kris Buytaert wrote:
> On Wed, 2009-06-24 at 12:02 -0700, Sunil Mushran wrote:
>
>> Do you have a separate network path for drbd traffic? If you do
>> not, then you are probably overloading the network. In this case,
>> I believe drbd is unable to replicate the
s:
> Jun 24 11:43:01 node5 kernel: [855031.567140]
> (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107
> Jun 24 11:43:01 node5 kernel: [855031.567140]
> (20663,7):dlm_send_remote_unlock_request:359 ERROR: status = -107
>
> The problem is that I couldn't acce
Explain "repository built by OCFS2 1.4.x"? What is this repository?
OCFS2, the file system, is architecture neutral. Meaning, it works
across 32-bit, 64-bit, little endian and big endian boxes. One can
mount an ocfs2 volume concurrently on x86, x86_64, ia64 and ppc64
nodes. They all have to be Lin
to their repository. Having
said that, a 64-bit system does not necessarily imply 64-bit apps.
The apps itself could be 32-bit.
Sunil Mushran wrote:
> Yes.
>
> But do read the notes section in the ocfs2 user's guide. It talks
> about having nodes with different compute power in a cl
ng documents on Repository which
> is mounted with OCFS2.
>
> That mean I can add 64bit Red Hat node in current 32 bit OCFS2 cluster?
>
> Thanks
>
>
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Thursday, J
Setup netconsole to trap the logs. Once you have it then file a
bugzilla and attach the logs of all 5 nodes.
On Jul 4, 2009, at 7:46 PM, Raheel Akhtar wrote:
> Hi,
>
>
>
> I am using OCFS2 1.4.2-1 for RedHat Linux 5.2 64 bit
> (2.6.18-128.1.6.el5) for 5 nodes. I notices sometime system just
Google it.
On Jul 4, 2009, at 7:58 PM, Raheel Akhtar wrote:
> Hi Sunil,
>
> Do you mind to send me link how to setup netconsole on RedHat I
> never did
> before.
> Highly appreciated.
>
> Raheel
>
>
> -----Original Message-
> From: Sunil Mushran [mai
Fixed. Details in http://oss.oracle.com/bugzilla/show_bug.cgi?id=914
syla...@aim.com wrote:
>
> Hi,
>
> On kernel 2.6.30 (and I have upgraded drbd there too to 8.3.2) I have
> nothing in the logs, and the umount hangs, and after a few minutes the
> whole computer hangs, and I have to hard r
The fix was for the oops you saw.
The hang is a different issue. We have no info on that.
For that, if you would like to diagnose the problem, read up the dlm notes
in the 1.4 user's guide. It explains a debugging process vis-a-vis hangs.
If the issue is dlm related, then we would like to have t
a noob, so I don't
> even know if it is a bug, or a misconfiguration, or a misunderstanding.
>
> PS. Is nodiratime option supported for mounts? I used it, but I don't
> see it in the user-guide.
>
> -Original Message-
> From: Sunil Mushran
> To: sylarrr
Please do remember to file a bugzilla. Once it is fixed, add the git
commit details to it.
On Jul 15, 2009, at 4:15 AM, Tiger Yang wrote:
> Hi, Marco,
>
> Thanks a lot, it is really a bug in ocfs2 acl. I can reproduce it now
> and find the cause. I will send a patch to fix it after done some
This patch should be in ocfs2-1.4. It is not in ocfs2-1.2. Ignore the
kernel version. Both 1.2 and 1.4 work on el5.
On Jul 15, 2009, at 3:41 AM, Kevin Clark
wrote:
> I've run into a problem mounting an OCFS2 filesystem on a DRBD
> device. I think it's the same one discussed at
> http://l
Please file a bugzilla @ oss.oracle.com/bugzilla
Attach this to it.
On Jul 15, 2009, at 6:37 PM, Marco Huang
wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
>
> Hi Tiger,
>
> I am also exporting the ocfs2 file system via nfs (with acl) to other
> servers. I am getting the following k
Please file a bugzilla and attach the netconsole logs of all six nodes.
The messages provided indicate that that node saw the two nodes
become unresponsive. As to why they became unresponsive will be
known only after we see the netconsole logs of the two nodes.
Raheel Akhtar wrote:
>
> Hi,
>
>
ocfs2_stackglue not found error message is harmless.
We use the same init script for all versions of the fs stackglue
is present in the current mainline and will be in ocfs2 1.6.
Raheel Akhtar wrote:
>
> Hi,
>
> When system booting getting error message “modprobe: FATAL: Module
> ocfs2_stackg
11:09:45 alf1 kernel: ocfs2_dlm: Nodes in domain
> ("7BE7E9E2026A40F8801B56257D805C88"): 0 1 2 3 4 5
> --
>
>
>
>
> -Original Message-
> From: Sunil Mushran [mailto:sunil.mush...@oracle.com]
> Sent: Wednesday, July 29, 2009 1:25 PM
"device busy" could be because you have a shell having that as the cwd.
Check:
ls -l /proc/[0-9]*/cwd
Georg Höllrigl wrote:
> Hello,
>
> I've several LUNs mounted in a 7 node cluster - one LUN which is only used on
> 4 of the nodes.
>
> It's impossible to umount this LUN - I'm always getting devi
Peter W. Morreale wrote:
> Hi all,
>
> I'm trying to determine the performance implications of various
> configurations for ocfs2. (I'm new to ocfs2, but have read through all
> the docs for both 1.2 and 1.4, so please be gentle :) This would be a
> 1.4 installation.
>
> I searched through www.
801 - 900 of 1424 matches
Mail list logo