Re: [Ocfs2-users] Did anything substantial change between 1.2.4 and 1.3.9?

2008-04-21 Thread Tao Ma
Hi Mike, Are you sure it is caused by the update of ocfs2-tools? AFAIK, the ocfs2-tools only include tools like mkfs, fsck and tunefs etc. So if you don't make any change to the disk(by using this new tools), it shouldn't cause the problem of kernel panic since they are all user space

Re: [Ocfs2-users] Did anything substantial change between 1.2.4 and 1.3.9?

2008-04-21 Thread Tao Ma
a network connection is considered dead. O2CB_IDLE_TIMEOUT_MS=1 # O2CB_KEEPALIVE_DELAY_MS: Max time in ms before a keepalive packet is sent O2CB_KEEPALIVE_DELAY_MS=5000 # O2CB_RECONNECT_DELAY_MS: Min time in ms between connection attempts O2CB_RECONNECT_DELAY_MS=2000 On 4/21/08, Tao Ma

Re: [Ocfs2-users] Unable to start cluster with one node

2008-05-12 Thread Tao Ma
Hi David, David Coulson wrote: This is probably a stupid question, but here we go. I have two boxes running RHEL4U6 with DRBD mirroring disk between them. DRBD is setup in active/active mode, and seems to be working nicely. I have OCFS2 filesystems build on the DRBD devices, and normally

Re: [Ocfs2-users] Unable to start cluster with one node

2008-05-12 Thread Tao Ma
Hi David, David Coulson wrote: Hi Tao, nt a file system without any change in the configuration. So you may try to mount it. If there is any problem, please paste the error message here. Thanks. I tried to create a filesystem on a unused DRBD block device... mkfs.ocfs2 seemed to work

Re: [Ocfs2-users] Problems building ocfs2 rpm on Fedora 9

2008-06-29 Thread Tao Ma
Hi Tina, datavolume is only used for ocfs2-1.2.* and ocfs2-1.4.* in the future if I am correct. It is oracle specific, so the main kernel doesn't have this mount option. Regards, Tao Tina Soles wrote: Thanks! I installed the tools rpm and the console as well. I've successfully

Re: [Ocfs2-users] ocfs2 kernel BUG

2008-08-01 Thread Tao Ma
Hi, Please provide the detail info of ocfs2 version which may be helpful for diagnose. Peter Selzner wrote: Hi, we had this entries in /var/log/messeges a few days ago: Jul 28 23:30:47 xxx kernel: (12268,2):ocfs2_extend_file:790 ERROR: bug expression: i_size_read(inode) !=

Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Tao Ma
Hi, Masanari Iida wrote: Hello list, There is a 14 node OCFS2 cluster. When I reboot all 14 nodes at once, some node failed to mount the ocfs2 filesystem while rebooting. The mount is supposed to be done by /etc/fstab. The symptom is happened on randam node. I would like to know if

Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Tao Ma
Masanari Iida wrote: On Tue, Aug 5, 2008 at 5:43 PM, Tao Ma [EMAIL PROTECTED] wrote: An error message I saw was mount.ocfs2: Transport endpoint is not connected while mounting /dev/EXTDISK/OCFS2 Interesting. Have you update ocfs2 in some nodes? Normally it happens when

Re: [Ocfs2-users] Enable mlog() messages

2008-08-05 Thread Tao Ma
Masanari Iida wrote: Hello again, I looked into the source and found the error message Transport endpoint is not connected could be came from ENOTCONN in tcp.c. There are multiple ENOTCONN, so I would like to know which one produce my message. I want to enable mlog(). My understanding

Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-06 Thread Tao Ma
Hi, Masanari Iida wrote: Hello Tao and Sunil, ] My case, the symptom (ocfs2 failed to mount a volume using /etc/fstab) happend when I reboot the system. Even if it failed to mount (by /etc/fstab), I can mount it later after I login the system. So it could be some kind of timing issue.

Re: [Ocfs2-users] New node..new problems

2008-10-09 Thread Tao Ma
Hi, Dante Garro wrote: Sunil, now I fall in count of messages are related to node 0, but the new is node 1 and does not care about the value I've setup allways says 14000 ms. Do this change your diagnostic? Node1 start connection with node0, so you see the messages related to node0 on node1.

Re: [Ocfs2-users] New node..new problems

2008-10-10 Thread Tao Ma
Dante Garro wrote: Thanks Tao, I've setup the same on both nodes and the cluster becomes online. Now, when I try to mount the following errors appears on node 1 (new CentOS): (2512,1):o2net_connect_expired:1585 ERROR: no connection established with node 0 after 30.0 seconds, giving up and

Re: [Ocfs2-users] OCFS2: ERROR (device sdh1): ocfs2_direct_IO_get_blocks

2009-03-01 Thread Tao Ma
Hi Daniel, Daniel Keisling wrote: Patch was here: http://oss.oracle.com/pipermail/ocfs2-devel/2008-September/002787.html yes, that patch has been merged into ocfs2-1.4 and should be ready for the next release. Also as Joel said, If you have the appropriate support, you should call support and

Re: [Ocfs2-users] OCFS2 fencing

2009-03-12 Thread Tao Ma
Hi ramya, ramya tn wrote: Hi All, One of our system fenced by itself few days back and this has been happening very frequently from many days. But unfortunately, we aree not able to stop the system fencing as we are not sure what is causing this. The error i found out in log file

Re: [Ocfs2-users] problem stopping o2cb service on one of nodes

2009-04-02 Thread Tao Ma
Hi Nikola, Nikola Ciprich wrote: Hi, I'm trying ocfs2 RHEL5 distro, 2.6.29 kernel, ocfstools-1.4.1. I'm using DRBD in primary/primary mode as shared storage... I've configured the service according to quickstart document, and everything works, but when I umount fs on both nodes,

Re: [Ocfs2-users] ocfs2 vs ext3?

2009-04-29 Thread Tao Ma
Andrew (Anything) wrote: Hi Andrew, I just checked max-features, it doesn't include local which means that you still need to create dlm lock in your local node which will cost some delay. You can check whether your volume enable local by command echo 'stats'|debugfs.ocfs2

Re: [Ocfs2-users] another node is heartbeating in our slot

2009-05-05 Thread Tao Ma
Hi sundar, sundar mahadevan wrote: Hi members, Newbie. Help pls. My setup: system 1: opensuse 11.1 with iscsitarget (secondary hard drive with logical volume) + ocfs2 system 2: opensuse 11.1 with open-iscsi (detects the logical volume on system 1) + ocfs2 1) mount -t ocfs /dev/sdb

Re: [Ocfs2-users] ocfs2 fencing with multipath and dual channel HBA

2009-06-08 Thread Tao Ma
Hi Florian, florian.engelm...@bt.com wrote: Hi Tao, Hi florian, florian.engelm...@bt.com wrote: Florian, the problem here seems to be with network. The nodes are running into network heartbeat timeout and hence second node is getting fenced. Do you see o2net thread consuming 100% cpu

Re: [Ocfs2-users] ocfs2 fencing with multipath and dual channel HBA

2009-06-08 Thread Tao Ma
Hi Florian, florian.engelm...@bt.com wrote: Hi Tao, Hi Florian, florian.engelm...@bt.com wrote: Hi Tao, Hi florian, florian.engelm...@bt.com wrote: Florian, the problem here seems to be with network. The nodes are running into network heartbeat timeout and hence second node is

Re: [Ocfs2-users] enable acl option for ocfs2

2009-06-18 Thread Tao Ma
Hi Marco, Marco Huang wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi, I am setting up two nodes ocfs2 on debian lenny (2.6.26-1-amd64), but ocfs2 doesn't come with posix acl by default. Doesn't any one can provide patch for enable acl? acl is added in ocfs2 in 2.6.29. So could

Re: [Ocfs2-users] umount hang + high CPU

2009-07-05 Thread Tao Ma
Hi, Is there something in your system log? I would guess there should be some info there. Regards, Tao syla...@aim.com wrote: Hi, I had a problem where I got a kernel bug in the logs in ocfs2. That happened when I unmounted the volume after a day or two that it was

Re: [Ocfs2-users] ocfs2 acl issue

2009-07-15 Thread Tao Ma
Hi Marco, From the stack it looks that it isn't acl related. So could you please file a bug in http://oss.oracle.com/bugzilla/(just for this, acl is another issue) with all these informations? Thanks. And could you please also do: objdump -DSl /lib/modules/`uname

Re: [Ocfs2-users] git checkout on an ocfs2 filesystem

2009-08-31 Thread Tao Ma
Hi Joel, Joel Becker wrote: On Mon, Aug 31, 2009 at 12:16:36PM -0700, Joel Becker wrote: On Sun, Aug 30, 2009 at 08:19:08PM -0500, Nathaniel Griswold wrote: Has anyone here had problems with git checkouts on ocfs2? Oh, boy, this is wacky. No, it's extra wacky: 5441

Re: [Ocfs2-users] kernel panic - bug in dlmglue.c ?

2009-09-11 Thread Tao Ma
Hi John, John McNulty wrote: Hi, I had a system crash last night. Netconsole caught the following trace dump. Has this one been seen before? This bug is fixed in mainline and should show up in next ocfs2 release. See http://oss.oracle.com/bugzilla/show_bug.cgi?id=1162 Regards, Tao

Re: [Ocfs2-users] core dump

2010-02-24 Thread Tao Ma
Hi Charlie, Charlie Sharkey wrote: Hi, We got this core dump while running the dd command. I haven’t matched The time of the dump with the /var/log/messages file, but I believe it was In response to a cable pull. you are right. I don't have an ocfs2 version for sles, but I guess it

Re: [Ocfs2-users] renaming a OCFS2 cluster

2010-02-24 Thread Tao Ma
Hi Werner, Werner Flamme wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi everyone, another problem I did not find a solution for... I ran o2cb configure and configured a cluster named ocfs2. Lazy me, I did not invent a new name. Now this cluster must be renamed to avoid

Re: [Ocfs2-users] No Space left on the device.

2010-03-04 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi My ocfs filesystem has 270gb free space. FilesystemSize Used Avail Use% Mounted on /dev/mapper/store 501G 232G 270G 47% /data INode details for ocfs filesystem is FilesystemInodes IUsed IFree IUse% Mounted

Re: [Ocfs2-users] No Space left on the device.

2010-03-04 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi, I am facing problem due to free space fragmentation. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1189. In the above link it is mentioned that reducing the slots can solve the issue. Right now i have 4slots. As my ocfs device is configured for

Re: [Ocfs2-users] No Space left on the device.

2010-03-04 Thread Tao Ma
Hi Brad, Brad Plant wrote: Hi Tao, On Fri, 05 Mar 2010 14:33:36 +0800 Tao Ma tao...@oracle.com wrote: Another way is that you can cp the file to another volume, remove it and then cp back. It should be contiguous enough. Assuming we *can* still write to the FS (i.e. as more

Re: [Ocfs2-users] No Space left on the device.

2010-03-04 Thread Tao Ma
Brad Plant wrote: Hi Tao, On Fri, 05 Mar 2010 15:03:50 +0800 Tao Ma tao...@oracle.com wrote: Assuming we *can* still write to the FS (i.e. as more of a preventative action), would the following do the same? cp -a a b mv b a Can the above work as a *hack* online defrag

Re: [Ocfs2-users] No space left on the device

2010-03-17 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi All, I have already sent one mail regarding the space issue i am facing with my ocfs filesystem. As mentioned in the below link it is an issue related to free space fragmentation. http://oss.oracle.com/bugzilla/show_bug.cgi?id=1189 I have seen a

Re: [Ocfs2-users] No space left on the device

2010-03-17 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi All, I have already sent one mail regarding the space issue i am facing with my ocfs filesystem. As mentioned in the below link it is an issue related to free space fragmentation.

Re: [Ocfs2-users] No space left on the device

2010-03-18 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi All, snip After running the tunefs.ocfs2 command i am getting the following error on my console node01#tunefs.ocfs2 -N 2 /dev/mapper/store

Re: [Ocfs2-users] No space left on the device

2010-03-18 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi All, I have already sent one mail regarding the space issue i am facing with my ocfs filesystem. As mentioned in the below link it is an issue related to free

Re: [Ocfs2-users] No space left on the device

2010-03-18 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi All, I have already sent one mail regarding the space issue i am facing with my ocfs filesystem. As mentioned in

Re: [Ocfs2-users] No space left on the device

2010-03-18 Thread Tao Ma
Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi Aravind, Aravind Divakaran wrote: Hi Tao, Hi Aravind, Aravind Divakaran wrote: Hi All, I have already sent one mail regarding the space issue i am facing with my ocfs filesystem. As mentioned in

Re: [Ocfs2-users] Compile error on RedHat EL5

2010-05-05 Thread Tao Ma
Hi Kristiansen, On 05/05/2010 04:34 PM, Kristiansen Morten wrote: Hi, I'm trying to compile ocfs2-tools 1.2.7 on a RedHat EL5 kernel 2.6.18-194.el5, but it fails during the make command: make[1]: Entering directory

Re: [Ocfs2-users] Compile error on RedHat EL5

2010-05-05 Thread Tao Ma
: ocfs2-users-boun...@oss.oracle.com [mailto:ocfs2-users-boun...@oss.oracle.com] På vegne av Tao Ma Sendt: 5. mai 2010 11:18 Til: Kristiansen Morten Kopi: ocfs2-users@oss.oracle.com Emne: Re: [Ocfs2-users] Compile error on RedHat EL5 Hi Kristiansen, On 05/05/2010 04:34 PM, Kristiansen

Re: [Ocfs2-users] List of issues resolved by ocfs2 patch

2010-05-14 Thread Tao Ma
Hi Hogas, Hogas Ciprian wrote: Hello guys Where can I see a list of issues resolved by a patch on ocfs2? For example I want to see what problems solve patch OCFS2 version 1.4.1-1. Thanks a lot. You can check the release note. http://oss.oracle.com/projects/ocfs2/news/ It has the

Re: [Ocfs2-users] ocfs2 debug tools

2010-05-30 Thread Tao Ma
Hi Nicola, On 05/30/2010 10:23 PM, Mailing List SVR wrote: Hi Sunil, even with the latest ocfs2 release (1.4.7 on rhel5) I'm having several issues, my systems hang pratically every two days when a lot of small files are deleted (about 200.000 files, 50-120KB each), can you please describe

Re: [Ocfs2-users] O2CB_HEARTBEAT_THRESHOLD won't take changes

2010-05-31 Thread Tao Ma
Hi Elliott, Elliott Perrin wrote: Hello All, I have multiple OCFS2 clusters on SLES10 SP2 running Xen. We needed to increase the O2CB_HEARTBEAT_THRESHOLD from 31 up to 61 and did so successfully on 2 of our 3 clusters. However on one of the three clusters we are not able to

Re: [Ocfs2-users] O2CB_HEARTBEAT_THRESHOLD won't take changes

2010-06-01 Thread Tao Ma
Elliott Perrin wrote: Hello Tao, Hi Elliott, Elliott Perrin wrote: Hello All, I have multiple OCFS2 clusters on SLES10 SP2 running Xen. We needed to increase the O2CB_HEARTBEAT_THRESHOLD from 31 up to 61 and did so successfully on 2 of our 3 clusters.

Re: [Ocfs2-users] OCFS2 performance - disk random access time problem

2010-06-02 Thread Tao Ma
Hi Proskurin, On 06/02/2010 05:23 PM, Proskurin Kirill wrote: On 01/06/10 22:34, Sunil Mushran wrote: The kernel is old. We fixed this issue in 2.6.30. We have also backported it to the 1.4 production tree. The problem was that the inodes being created did not have locality leading to a

Re: [Ocfs2-users] OCFS2 performance - disk random access time problem

2010-06-02 Thread Tao Ma
Proskurin Kirill wrote: On 02/06/10 13:26, Tao Ma wrote: Thank you for reply! It is enough to update kernel or tools need to be updated too? If you only want to use the old formatted volume, updating kernel is enough. But if you want to use some new features we added, better

Re: [Ocfs2-users] OCFS2 performance - disk random access time problem

2010-06-02 Thread Tao Ma
Add Mark Fasheh mfas...@suse.com and Coly Li coly...@suse.de to cc since they know what ocfs2 kernel version SUSE uses. Angelo McComis wrote: On 01/06/10 22:34, Sunil Mushran wrote: The kernel is old. We fixed this issue in 2.6.30. We have also backported it to the 1.4

Re: [Ocfs2-users] 'No space left on device' error with plenty of space.

2010-06-09 Thread Tao Ma
Hi Jason, On 06/09/2010 11:34 PM, Jason Price wrote: And now it's starting to fail again. How about the situation? I checked your stat_sysfs output, it looks that you have spaces for inode, extent alloc and local alloc(but maybe the kernel haven't flushed the metadata to the disk while the

Re: [Ocfs2-users] OCFS2 error.

2010-06-22 Thread Tao Ma
Hi Veeraa, On 06/23/2010 10:46 AM, veeraa bose wrote: Hi Team, Hi Team, we are getting below error in shared disk on VMwares guest operating system. Jun 23 01:46:12 SCRBXLPDEFRM635 kernel: sd 1:0:3:0: reservation conflict Jun 23 01:46:12 SCRBXLPDEFRM635 kernel: sd 1:0:3:0: SCSI error:

Re: [Ocfs2-users] df showing wrong size

2010-06-28 Thread Tao Ma
Hi Garcia, On 06/28/2010 02:17 PM, Garcia, Raymundo wrote: Hello… it was put under my attention that a partition we have in one of our production system was displaying wrong size with df command…. 123 GB… but in fact the size of all the files is a mere 15 GB…. What is going on? Shall we use

Re: [Ocfs2-users] Too much journaling or not ?

2010-07-30 Thread Tao Ma
Hi Somsak, On 07/30/2010 12:54 AM, Somsak Sriprayoonsakul wrote: Hi, (I am in the same team with Mr. Wanchat) Just want to note that we already format OCFS2 with -T mail option. As note below, data=writeback,noatime, and commit interval has been increase already. The weird thing about

Re: [Ocfs2-users] No space left on device

2010-09-08 Thread Tao Ma
Hi all On 09/08/2010 04:11 PM, Alexander Barton wrote: Hi Sunil! Are there special steps one has to follow to recover such a filesystem that has been used with a buggy kernel? We had this problem with a Debian 2.6.27 kernel and updated to a recent „mainline“ kernel 2.6.33.x – but are

Re: [Ocfs2-users] No space left on device

2010-09-08 Thread Tao Ma
Hi, Alexander Barton wrote: Hi Tao! Am 08.09.2010 um 10:53 schrieb Tao Ma: Hi all On 09/08/2010 04:11 PM, Alexander Barton wrote: Hi Sunil! Are there special steps one has to follow to recover such a filesystem that has been used with a buggy kernel? We had this problem

Re: [Ocfs2-users] No space left on device

2010-09-08 Thread Tao Ma
Alexander Barton wrote: Hi Tao! Am 08.09.2010 um 16:22 schrieb Tao Ma: Hi, Alexander Barton wrote: Hi Tao! Am 08.09.2010 um 10:53 schrieb Tao Ma: Hi all On 09/08/2010 04:11 PM, Alexander Barton wrote: Hi Sunil! Are there special steps one has

Re: [Ocfs2-users] No space left on device

2010-09-21 Thread Tao Ma
On 09/21/2010 04:52 PM, Alexander Barton wrote: Hi Tao! Am 09.09.2010 um 02:29 schrieb Tao Ma: btw, I may commit the ocfs2-tools patches recently, and you can try it with 2.6.35. Ok, now we are seeing the problem again and want to try a new kernel and the new OCFS2 tools

Re: [Ocfs2-users] No space left on device

2010-09-29 Thread Tao Ma
On 09/29/2010 05:13 PM, Alexander Barton wrote: Hello again! Am 21.09.2010 um 11:04 schrieb Tao Ma: On 09/21/2010 04:52 PM, Alexander Barton wrote: So kernel 2.6.35.4 would be ok? It should work. And OCFS2 tools from the GIT master branch? Or a special tag? There is no archive

Re: [Ocfs2-users] Journal replay after crash, kernel BUG at fs/ocfs2/journal.c:1700!, 2.6.36

2010-10-29 Thread Tao Ma
Hi Ronald, On 10/29/2010 05:12 PM, Ronald Moesbergen wrote: Hello, I was testing kernel 2.6.36 (vanilla mainline) and encountered the following BUG(): [157756.266000] o2net: no longer connected to node app01 (num 0) at 10.2.25.13: [157756.266077]

Re: [Ocfs2-users] Journal replay after crash, kernel BUG at fs/ocfs2/journal.c:1700!, 2.6.36

2010-10-29 Thread Tao Ma
Ronald Moesbergen wrote: 2010/10/29 Ronald Moesbergen intercom...@gmail.com: 2010/10/29 Tao Ma tao...@oracle.com: Hi Ronald, Hi Tao, Thanks for looking into this. On 10/29/2010 05:12 PM, Ronald Moesbergen wrote: Hello, I was testing kernel 2.6.36 (vanilla

Re: [Ocfs2-users] Journal replay after crash, kernel BUG at fs/ocfs2/journal.c:1700!, 2.6.36

2010-11-01 Thread Tao Ma
Hi Ronald, On 10/29/2010 06:03 PM, Ronald Moesbergen wrote: 2010/10/29 Ronald Moesbergenintercom...@gmail.com: 2010/10/29 Tao Matao...@oracle.com: Hi Ronald, Hi Tao, Thanks for looking into this. On 10/29/2010 05:12 PM, Ronald Moesbergen wrote: Hello, I was testing kernel 2.6.36

Re: [Ocfs2-users] Pb with ocfs2 dlm on Fedora 13

2010-11-08 Thread Tao Ma
Hi Alain, On 11/08/2010 11:08 PM, Alain.Moulle wrote: Hi, I have a problem on Fedora13 with releases : ocfs2 1.4.3-5.fc13.x86_64 dlm_tool 3.0.17 With a 3 nodes ocfs2 cluster, I can't mount FS on the three nodes at the same time but only on two nodes among the 3 nodes , whatever

Re: [Ocfs2-users] Pb with ocfs2 dlm on Fedora 13

2010-11-09 Thread Tao Ma
nodes on IP addr given in cluster.conf. Alain Tao Ma a écrit : Hi Alain, On 11/08/2010 11:08 PM, Alain.Moulle wrote: Hi, I have a problem on Fedora13 with releases : ocfs2 1.4.3-5.fc13.x86_64 dlm_tool 3.0.17 With a 3 nodes ocfs2 cluster, I can't mount FS on the three nodes