Re: [Gluster-devel] Metrics: and how to get them out from gluster

2017-09-01 Thread Xavier Hernandez
Hi Amar, I don't have time to review the changes in experimental branch yet, but here are some comments about these ideas... On 01/09/17 07:27, Amar Tumballi wrote: Disclaimer: This email is long, and did take significant time to write. Do take time and read, review and give feedback, so we c

Re: [Gluster-devel] GlusterFS v3.12 - Nearing deadline for branch out

2017-07-18 Thread Xavier Hernandez
Hi, On 17/07/17 17:30, Pranith Kumar Karampuri wrote: hi, Status of the following features targeted for 3.12: 1) Need a way to resolve split-brain (#135) : Mostly will be merged in a day. 2) Halo Hybrid mode (#217): Unfortunately didn't get time to follow up on this, so will not make it t

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-07-07 Thread Xavier Hernandez
On 07/07/17 11:25, Pranith Kumar Karampuri wrote: On Fri, Jul 7, 2017 at 2:46 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: On 07/07/17 10:12, Pranith Kumar Karampuri wrote: On Fri, Jul 7, 2017 at 1:13 PM, Xavier Hernandez mailto:xhernan...@data

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-07-07 Thread Xavier Hernandez
On 07/07/17 10:12, Pranith Kumar Karampuri wrote: On Fri, Jul 7, 2017 at 1:13 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi Pranith, On 05/07/17 12:28, Pranith Kumar Karampuri wrote: On Tue, Jul 4, 2017 at 2:26 PM, Xavier Hernandez mailto:x

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-07-07 Thread Xavier Hernandez
Hi Pranith, On 05/07/17 12:28, Pranith Kumar Karampuri wrote: On Tue, Jul 4, 2017 at 2:26 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi Pranith, On 03/07/17 08:33, Pranith Kumar Karampuri wrote: Xavi, Now that the change has been revert

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-07-04 Thread Xavier Hernandez
2:08 PM, Karthik Subrahmanya mailto:ksubr...@redhat.com>> wrote: On Wed, Jun 21, 2017 at 1:56 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: That's ok. I'm currently unable to write a patch for this on ec. Sunil is working on this patch.

Re: [Gluster-devel] Disperse volume : Sequential Writes

2017-07-04 Thread Xavier Hernandez
o:aspan...@redhat.com>> wrote: I think it should be done as we have agreement on basic design. *From: *"Pranith Kumar Karampuri" mailto:pkara...@redhat.com>> *To: *"Xavier Hernandez&

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-06-21 Thread Xavier Hernandez
That's ok. I'm currently unable to write a patch for this on ec. If no one can do it, I can try to do it in 6 - 7 hours... Xavi On Wednesday, June 21, 2017 09:48 CEST, Pranith Kumar Karampuri wrote:    On Wed, Jun 21, 2017 at 1:00 PM, Xavier Hernandez wrote:I'm ok with reve

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-06-21 Thread Xavier Hernandez
d similar changes in ec as well. If we are not in agreement, then we will let the discussion progress :-)   Regards,Nithya-- Aravinda  Thanks to all of you guys for the discussions! On Tue, Jun 20, 2017 at 5:05 PM, Xavier Hernandez wrote:Hi Aravinda, On 20/06/17 12:42, Aravinda wrote:I think

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-06-20 Thread Xavier Hernandez
. regards Aravinda VK On 06/20/2017 03:06 PM, Aravinda wrote: Hi Xavi, On 06/20/2017 02:51 PM, Xavier Hernandez wrote: Hi Aravinda, On 20/06/17 11:05, Pranith Kumar Karampuri wrote: Adding more people to get a consensus about this. On Tue, Jun 20, 2017 at 1:49 PM, Aravinda mailto:av

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-06-20 Thread Xavier Hernandez
Hi Aravinda, On 20/06/17 11:05, Pranith Kumar Karampuri wrote: Adding more people to get a consensus about this. On Tue, Jun 20, 2017 at 1:49 PM, Aravinda mailto:avish...@redhat.com>> wrote: regards Aravinda VK On 06/20/2017 01:26 PM, Xavier Hernandez wrote: Hi P

Re: [Gluster-devel] geo-rep regression because of node-uuid change

2017-06-20 Thread Xavier Hernandez
Hi Pranith, adding gluster-devel, Kotresh and Aravinda, On 20/06/17 09:45, Pranith Kumar Karampuri wrote: On Tue, Jun 20, 2017 at 1:12 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: On 20/06/17 09:31, Pranith Kumar Karampuri wrote: The way geo-replication wo

Re: [Gluster-devel] Self-heal on read-only volumes

2017-06-20 Thread Xavier Hernandez
I remember either Kotresh/Karthik recently sent patches to do something similar. Adding them to check if the know something about this On Fri, Jun 16, 2017 at 1:25 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi, currently it seems that a read-on

Re: [Gluster-devel] Disperse volume : Sequential Writes

2017-06-16 Thread Xavier Hernandez
On 16/06/17 10:51, Pranith Kumar Karampuri wrote: On Fri, Jun 16, 2017 at 12:02 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: On 15/06/17 11:50, Pranith Kumar Karampuri wrote: On Thu, Jun 15, 2017 at 11:51 AM, Ashish Pandey mailto:aspan...@redh

[Gluster-devel] Self-heal on read-only volumes

2017-06-16 Thread Xavier Hernandez
Hi, currently it seems that a read-only replica 2 volume cannot be healed because all attempts to make changes by the self-heal daemon on the damaged brick will fail with EROFS. It's true that no regular writes are allowed, so there's no possibility to cause damage by partial writes or simil

Re: [Gluster-devel] Disperse volume : Sequential Writes

2017-06-15 Thread Xavier Hernandez
On 15/06/17 11:50, Pranith Kumar Karampuri wrote: On Thu, Jun 15, 2017 at 11:51 AM, Ashish Pandey mailto:aspan...@redhat.com>> wrote: Hi All, We have been facing some issues in disperse (EC) volume. We know that currently EC is not good for random IO as it requires READ-MODIFY

Re: [Gluster-devel] Performance experiments with io-stats translator

2017-06-06 Thread Xavier Hernandez
Hi Krutika, On 06/06/17 13:35, Krutika Dhananjay wrote: Hi, As part of identifying performance bottlenecks within gluster stack for VM image store use-case, I loaded io-stats at multiple points on the client and brick stack and ran randrd test using fio from within the hosted vms in parallel.

Re: [Gluster-devel] GFID2 - Proposal to add extra byte to existing GFID

2017-05-15 Thread Xavier Hernandez
Hi Amar, On May 15, 2017 2:15 PM, Amar Tumballi wrote: > > > > On Tue, Apr 11, 2017 at 2:59 PM, Amar Tumballi wrote: >> >> Comments inline. >> >> On Mon, Dec 19, 2016 at 1:47 PM, Xavier Hernandez wrote: >>> >>> On 12/19/2016 07:57 AM

Re: [Gluster-devel] [DHT] The myth of two hops for linkto file resolution

2017-05-04 Thread Xavier Hernandez
Hi, On 30/04/17 06:03, Raghavendra Gowdappa wrote: All, Its a common perception that the resolution of a file having linkto file on the hashed-subvol requires two hops: 1. client to hashed-subvol. 2. client to the subvol where file actually resides. While it is true that a fresh lookup behav

Re: [Gluster-devel] [Gluster-users] Disperse mkdir fails

2017-03-15 Thread Xavier Hernandez
riginal Message----- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Tuesday, March 14, 2017 5:28 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-us...@gluster.org Subject: Re: [Gluster-users] Disperse mkdir fails Hi Ram, On 13/03/17 15:02, Ankireddypa

Re: [Gluster-devel] [Gluster-users] Disperse mkdir fails

2017-03-14 Thread Xavier Hernandez
do you things, from how many clients, ... Xavi Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Monday, March 13, 2017 9:56 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-us...@gluster.org Subject

Re: [Gluster-devel] [Gluster-users] Disperse mkdir fails

2017-03-13 Thread Xavier Hernandez
m can be avoided. 2) How do we fix the current state of the cluster. Thanks and Regards, Ram -Original Message----- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Friday, March 10, 2017 3:34 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-u

Re: [Gluster-devel] Issue in locks xlators

2017-03-10 Thread Xavier Hernandez
Hi Nigel, On 10/03/17 10:11, Nigel Babu wrote: We don't currently save the logs for aborted jos, but I can set that up for you. What files do you want logged? I would need the mount point and brick logs. Thanks, Xavi On Fri, Mar 10, 2017 at 1:15 PM, Xavier Hernandez mailto:xh

[Gluster-devel] Reserve Locks

2017-03-10 Thread Xavier Hernandez
Hi, I'm looking at the locks xlator and I see that it has something called reserve locks. I remember long time ago that someone said this was defined for some purpose, but I don't remember and currently I'm unable to identify any place in the code where these locks are really used. I think i

Re: [Gluster-devel] [Gluster-users] Disperse mkdir fails

2017-03-10 Thread Xavier Hernandez
nformation pausing all activity to that directory. Xavi Thanks and Regards, Ram -Original Message----- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Thursday, March 09, 2017 11:15 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-us...@g

[Gluster-devel] Issue in locks xlators

2017-03-09 Thread Xavier Hernandez
Hi, I've posted a patch [1] to fix a memory leak in locks xlator. The fix seems quite straightforward, however I've seen a deadlock in the centos regression twice [2] [3] on the locks_revocation.t test, causing the test to timeout and be aborted. At first sight I haven't seen other failures

Re: [Gluster-devel] [Gluster-users] Disperse mkdir fails

2017-03-09 Thread Xavier Hernandez
Hi Ram, On 09/03/17 16:52, Ankireddypalle Reddy wrote: Attachment (1): 1 info.txt

Re: [Gluster-devel] Pluggable interface for erasure coding?

2017-03-03 Thread Xavier Hernandez
Wednesday(8th of March) or Thursday(9th of March) next week work for you guys? Best, Per Simonsen CEO MemoScale On Thu, Mar 2, 2017 at 12:00 AM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi Niels,

Re: [Gluster-devel] Pluggable interface for erasure coding?

2017-03-02 Thread Xavier Hernandez
Hi Niels, On 02/03/17 07:58, Niels de Vos wrote: Hi guys, I think this is a topic/question that has come up before, but I can not find any references or feature requests related to it. Because there are different libraries for Erasure Coding, it would be interesting to be able to select alterna

Re: [Gluster-devel] release-3.10: Final call for release notes updates

2017-02-20 Thread Xavier Hernandez
Hi Shyam, I've added some comments [1] for the issue between disperse's dynamic code generator and SELinux. It assumes that [2] will be backported to 3.10. Xavi [1] https://review.gluster.org/16685 [2] https://review.gluster.org/16614 On 20/02/17 04:04, Shyam wrote: Hi, Please find the lat

Re: [Gluster-devel] https://review.gluster.org/#/c/16643/

2017-02-20 Thread Xavier Hernandez
Hi Nithya, I've merged it. However Vijay said in another email [1] that backports to 3.9 are not needed anymore. Xavi [1] http://lists.gluster.org/pipermail/gluster-devel/2017-February/052107.html On 20/02/17 09:19, Nithya Balachandran wrote: Hi, Can this be merged ? This is holding up m

[Gluster-devel] Reviews needed

2017-02-16 Thread Xavier Hernandez
Hi everyone, I would need some reviews if you have some time: A memory leak fix in fuse: * Patch already merged in master and 3.10 * Backport to 3.9: https://review.gluster.org/16402 * Backport to 3.8: https://review.gluster.org/16403 A safe fallback for dynamic code generation in E

Re: [Gluster-devel] Release 3.10: Request fix status for RC1 tagging

2017-02-16 Thread Xavier Hernandez
Hi Shyam, On 16/02/17 02:47, Shyam wrote: Hi, The 3.10 release tracker [1], shows 6 bugs needing a fix in 3.10. We need to get RC1 out so that we can start tracking the same for a potential release. Request folks on these bugs to provide a date by when we can expect a fix for these issues. Re

Re: [Gluster-devel] patch for "limited performance for disperse volumes"

2017-02-10 Thread Xavier Hernandez
Hi Raghavendra, On 10/02/17 04:51, Raghavendra Gowdappa wrote: +gluster-devel - Original Message - From: "Milind Changire" To: "Raghavendra Gowdappa" Cc: "rhs-zteam" Sent: Thursday, February 9, 2017 11:00:18 PM Subject: patch for "limited performance for disperse volumes" My first

Re: [Gluster-devel] Release 3.10: Backports (reminder and action needed)

2017-02-07 Thread Xavier Hernandez
Hi Shyam, On 06/02/17 19:12, Shyam wrote: Hi, A recheck after one more week, and our status is healthy, backports that appear in 3.8 or 3.9 are appearing against 3.10 as well. Thank you. However the exception raised last week is not handled yet. Pranith/Xavi/Ashish, The following commits wer

Re: [Gluster-devel] Release 3.10: Backports (reminder and action needed)

2017-02-06 Thread Xavier Hernandez
Hi Shyam, I'll backport both patches to 3.10. On 06/02/17 19:12, Shyam wrote: Hi, A recheck after one more week, and our status is healthy, backports that appear in 3.8 or 3.9 are appearing against 3.10 as well. Thank you. However the exception raised last week is not handled yet. Pranith/Xa

Re: [Gluster-devel] Creating new options for multiple gluster versions

2017-01-30 Thread Xavier Hernandez
Hi Atin, On 31/01/17 05:45, Atin Mukherjee wrote: On Mon, Jan 30, 2017 at 9:02 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi Atin, On 30/01/17 15:25, Atin Mukherjee wrote: On Mon, Jan 30, 2017 at 7:30 PM, Xavier Hernandez mailto:x

Re: [Gluster-devel] Creating new options for multiple gluster versions

2017-01-30 Thread Xavier Hernandez
Hi Atin, On 30/01/17 15:25, Atin Mukherjee wrote: On Mon, Jan 30, 2017 at 7:30 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi, I'm wondering how a new option needs to be created to be available to different versions of gluster. When a new option

[Gluster-devel] Creating new options for multiple gluster versions

2017-01-30 Thread Xavier Hernandez
Hi, I'm wondering how a new option needs to be created to be available to different versions of gluster. When a new option is created for 3.7 for example, it needs to have a GD_OP_VERSION referencing the next 3.7 release. This ensures that there won't be any problem with previous versions.

Re: [Gluster-devel] Spurious regression failure? tests/basic/ec/ec-background-heals.t

2017-01-26 Thread Xavier Hernandez
Hi Atin, I don't clearly see what's the problem. Even if the truncate causes a dirty flag to be set, eventually it should be removed before the $HEAL_TIMEOUT value. For now I've marked the test as bad. Patch is: https://review.gluster.org/16470 Xavi On 25/01/17 17:24, Atin Mukherjee wrote:

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-23 Thread Xavier Hernandez
ards, Ram Sent from my iPhone On Jan 23, 2017, at 3:11 AM, Xavier Hernandez wrote: Hi Ram, On 20/01/17 21:06, Ankireddypalle Reddy wrote: Attachments (2): 1 glustershd.log <https://imap.commvault.com/webconsole/embedded.do?url=https://imap.commvault.com/webconsole/api/drive/publi

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-23 Thread Xavier Hernandez
noise to the real problem. Please find attached the trace logs and heal info output. I'll examine the logs to see if there's something, but the previous patch will help a lot. Xavi Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [m

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-20 Thread Xavier Hernandez
rrors logged by ec_check_status() are not real problems. See patch http://review.gluster.org/16435/ for more info. Xavi Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Friday, January 20, 2017 2:41 AM To: Ankireddypalle Reddy; Ashish P

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-19 Thread Xavier Hernandez
sterfs4sds,glusterfs5sds,glusterfs6sds diagnostics.client-log-level: INFO [root@glusterfs4 glusterfs]# Thanks and Regards, Ram *From:*Ashish Pandey [mailto:aspan...@redhat.com] *Sent:* Thursday, January 19, 2017 10:36 PM *To:* Ankireddypalle Reddy *Cc:* Xavier Hernandez; gluster-us...@gluster.or

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-16 Thread Xavier Hernandez
ectly readable and writable. It's true that there's some problem here and it could derive in EIO if one of the healthy bricks degrades, but at least this file shouldn't be giving EIO errors for now. Xavi Sent on from my iPhone On Jan 16, 2017, at 6:23 AM, Xavier Hernandez wr

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-16 Thread Xavier Hernandez
the trace log ? is there any way I could download it ? Xavi Thanks and Regards, Ram -Original Message- From: gluster-devel-boun...@gluster.org [mailto:gluster-devel-boun...@gluster.org] On Behalf Of Ankireddypalle Reddy Sent: Friday, January 13, 2017 4:17 AM To: Xavier Hernandez

Re: [Gluster-devel] Question about EC locking

2017-01-13 Thread Xavier Hernandez
ilto:jayakrishnan...@gmail.com>> wrote: Thanks Xavier, for making it clear. Regards JK On Dec 13, 2016 3:52 PM, "Xavier Hernandez" mailto:xhernan...@datalab.es>> wrote: Hi JK, On 12/13/2016 08:34 AM, jayakrishnan mm wrote: Dear

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-13 Thread Xavier Hernandez
fy the cause. Again, the TRACE log will be really useful. Xavi Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Thursday, January 12, 2017 6:40 AM To: Ankireddypalle Reddy Cc: Gluster Devel (gluster-devel@gluster.org); gluster-us..

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-12 Thread Xavier Hernandez
self-heal. Xavi Thanks and Regards, Ram Sent from my iPhone On Jan 12, 2017, at 2:25 AM, Xavier Hernandez wrote: Hi Ram, On 12/01/17 02:36, Ankireddypalle Reddy wrote: Xavi, I added some more logging information. The trusted.ec.size field values are in fact different

Re: [Gluster-devel] [Gluster-users] Lot of EIO errors in disperse volume

2017-01-11 Thread Xavier Hernandez
.209753] W [MSGID: 122002] [ec-common.c:71:ec_heal_report] 0-glusterfsProd-disperse-4: Heal failed [Invalid argument] Thanks and Regards, Ram -Original Message- From: Ankireddypalle Reddy Sent: Wednesday, January 11, 2017 9:29 AM To: Ankireddypalle Reddy; Xavier Hernandez; Gluster Devel (

Re: [Gluster-devel] Lot of EIO errors in disperse volume

2017-01-10 Thread Xavier Hernandez
ributes of the file itself, not the parent directories. Xavi Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Tuesday, January 10, 2017 7:53 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-us...@gluster.org Subj

Re: [Gluster-devel] Lot of EIO errors in disperse volume

2017-01-10 Thread Xavier Hernandez
ticed that some of these operations would succeed if retried. Do you know of any communicated related errors that are being reported/triaged. Thanks and Regards, Ram -Original Message----- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Tuesday, January 10, 2017 7:23 AM To: Anki

Re: [Gluster-devel] Lot of EIO errors in disperse volume

2017-01-10 Thread Xavier Hernandez
ibutes from the exact file that triggers the EIO. The attached attributes seem consistent and that directory shouldn't cause any problem. Does an 'ls' on that directory fail or does it show the contents ? Xavi Thanks and Regards, Ram -----Original Message- From: Xavier Her

Re: [Gluster-devel] Lot of EIO errors in disperse volume

2017-01-10 Thread Xavier Hernandez
servers at a time. The volume was brought down during upgrade. Thanks and Regards, Ram -Original Message- From: Xavier Hernandez [mailto:xhernan...@datalab.es] Sent: Tuesday, January 10, 2017 6:35 AM To: Ankireddypalle Reddy; Gluster Devel (gluster-devel@gluster.org); gluster-us

Re: [Gluster-devel] Lot of EIO errors in disperse volume

2017-01-10 Thread Xavier Hernandez
Hi Ram, how did you upgrade gluster ? from which version ? Did you upgrade one server at a time and waited until self-heal finished before upgrading the next server ? Xavi On 10/01/17 11:39, Ankireddypalle Reddy wrote: Hi, We upgraded to GlusterFS 3.7.18 yesterday. We see lot of fai

Re: [Gluster-devel] GFID2 - Proposal to add extra byte to existing GFID

2016-12-19 Thread Xavier Hernandez
On 12/19/2016 07:57 AM, Aravinda wrote: regards Aravinda On 12/16/2016 05:47 PM, Xavier Hernandez wrote: On 12/16/2016 08:31 AM, Aravinda wrote: Proposal to add one more byte to GFID to store "Type" information. Extra byte will represent type(directory: 00, file: 01, Symlink: 02

Re: [Gluster-devel] GFID2 - Proposal to add extra byte to existing GFID

2016-12-16 Thread Xavier Hernandez
On 12/16/2016 08:31 AM, Aravinda wrote: Proposal to add one more byte to GFID to store "Type" information. Extra byte will represent type(directory: 00, file: 01, Symlink: 02 etc) For example, if a directory GFID is f4f18c02-0360-4cdc-8c00-0164e49a7afd then, GFID2 will be 00f4f18c02-0360-4cdc-8c

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-15 Thread Xavier Hernandez
On 12/15/2016 01:41 PM, Nithya Balachandran wrote: On 15 December 2016 at 18:07, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: On 12/15/2016 12:48 PM, Raghavendra Gowdappa wrote: I need to step back a little to understand the RCA correctly. If I understa

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-15 Thread Xavier Hernandez
On 12/15/2016 12:48 PM, Raghavendra Gowdappa wrote: I need to step back a little to understand the RCA correctly. If I understand the code correctly, the callstack which resulted in failed setattr is (in rebalance process): dht_lookup -> dht_lookup_cbk -> dht_lookup_everwhere -> dht_lookup_eve

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-14 Thread Xavier Hernandez
On 12/14/2016 10:28 AM, Pranith Kumar Karampuri wrote: On Wed, Dec 14, 2016 at 2:54 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: On 12/14/2016 10:17 AM, Pranith Kumar Karampuri wrote: On Wed, Dec 14, 2016 at 1:48 PM, Xavier Hernandez mailto:x

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-14 Thread Xavier Hernandez
On 12/14/2016 10:17 AM, Pranith Kumar Karampuri wrote: On Wed, Dec 14, 2016 at 1:48 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: There's another issue with the patch that Ashish sent. The original problem is that a setattr on a symbolic link gets trans

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-14 Thread Xavier Hernandez
original file. This way most of these problems should be solved. Xavi On 12/14/2016 09:02 AM, Xavier Hernandez wrote: On 12/14/2016 06:10 AM, Raghavendra Gowdappa wrote: - Original Message - From: "Pranith Kumar Karampuri" To: "Ashish Pandey" Cc: "Gluster D

Re: [Gluster-devel] 1402538 : Assertion failure during rebalance of symbolic links

2016-12-14 Thread Xavier Hernandez
On 12/14/2016 06:10 AM, Raghavendra Gowdappa wrote: - Original Message - From: "Pranith Kumar Karampuri" To: "Ashish Pandey" Cc: "Gluster Devel" , "Shyam Ranganathan" , "Nithya Balachandran" , "Xavier Hernandez" , "Ra

Re: [Gluster-devel] Question about EC locking

2016-12-12 Thread Xavier Hernandez
n mm mailto:jayakrishnan...@gmail.com>> wrote: Hi Xavier, Thank you very much for your explanation. This helped me to understand more about locking in EC. Best Regards JK On Mon, Nov 28, 2016 at 4:17 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> w

Re: [Gluster-devel] Question about EC locking

2016-11-28 Thread Xavier Hernandez
Hi, On 11/28/2016 02:59 AM, jayakrishnan mm wrote: Hi Xavier, Notice that EC xlator uses blocking locks. Any specific reason for this? In a distributed filesystem like gluster a synchronization mechanism is a must to avoid data corruption. Do you think this will affect the performance

Re: [Gluster-devel] Why vandermonde matrix is used in EC?

2016-11-27 Thread Xavier Hernandez
ad from that brick. We need to verify in some way that the other bricks do not contain updated data. Best regards, Xavi Best regards, Han 2016-11-24 17:26 GMT+09:00 Xavier Hernandez mailto:xhernan...@datalab.es>>: Hi Han, On 11/24/2016 04:25 AM, 한우형 wrote: Hi,

Re: [Gluster-devel] Why vandermonde matrix is used in EC?

2016-11-24 Thread Xavier Hernandez
Hi Han, On 11/24/2016 04:25 AM, 한우형 wrote: Hi, I'm working on dispersed volume(ec) and I found ec encode/decode algorithm is using non-systematic vandermonde matrix. My question is this: why non-systematic algorithm is used? Non-systematic encoding/decoding doesn't alter performance when one

Re: [Gluster-devel] Possible problem introduced by http://review.gluster.org/15573

2016-10-24 Thread Xavier Hernandez
Hi Soumya, On 21/10/16 16:15, Soumya Koduri wrote: On 10/21/2016 06:35 PM, Soumya Koduri wrote: Hi Xavi, On 10/21/2016 12:57 PM, Xavier Hernandez wrote: Looking at the code, I think that the added fd_unref() should only be called if the fop preparation fails. Otherwise the callback already

Re: [Gluster-devel] Possible problem introduced by http://review.gluster.org/15573

2016-10-23 Thread Xavier Hernandez
On 21/10/16 15:05, Soumya Koduri wrote: Hi Xavi, On 10/21/2016 12:57 PM, Xavier Hernandez wrote: Looking at the code, I think that the added fd_unref() should only be called if the fop preparation fails. Otherwise the callback already unreferences the fd. Code flow

Re: [Gluster-devel] Possible problem introduced by http://review.gluster.org/15573

2016-10-21 Thread Xavier Hernandez
Hi Niels, On 21/10/16 10:03, Niels de Vos wrote: On Fri, Oct 21, 2016 at 09:03:30AM +0200, Xavier Hernandez wrote: Hi, I've just tried Gluster 3.8.5 with Proxmox using gfapi and I consistently see a crash each time an attempt to connect to the volume is made. Thanks, that likely is the

Re: [Gluster-devel] Possible problem introduced by http://review.gluster.org/15573

2016-10-21 Thread Xavier Hernandez
. * When glfs_io_async_cbk() is called another ref is released. Note that if fop preparation fails, a single fd_unref() is called, but on success two fd_unref() are called. Xavi On 21/10/16 09:03, Xavier Hernandez wrote: Hi, I've just tried Gluster 3.8.5 with Proxmox using gfapi

[Gluster-devel] Possible problem introduced by http://review.gluster.org/15573

2016-10-21 Thread Xavier Hernandez
Hi, I've just tried Gluster 3.8.5 with Proxmox using gfapi and I consistently see a crash each time an attempt to connect to the volume is made. The backtrace of the crash shows this: #0 pthread_spin_lock () at ../nptl/sysdeps/x86_64/pthread_spin_lock.S:24 #1 0x7fe5345776a5 in fd_unref

Re: [Gluster-devel] Multiplexing - good news, bad news, and a plea for help

2016-09-20 Thread Xavier Hernandez
On 19/09/16 15:26, Jeff Darcy wrote: I have brick multiplexing[1] functional to the point that it passes all basic AFR, EC, and quota tests. There are still some issues with tiering, and I wouldn't consider snapshots functional at all, but it seemed like a good point to see how well it work

Re: [Gluster-devel] Review request for 3.9 patches

2016-09-18 Thread Xavier Hernandez
Hi Poornima, On 19/09/16 07:01, Poornima Gurusiddaiah wrote: Hi All, There are 3 more patches that we need for enabling md-cache invalidation in 3.9. Request your help with the reviews: http://review.gluster.org/#/c/15378/ - afr: Implement IPC fop http://review.gluster.org/#/c/15387/ - ec:

Re: [Gluster-devel] Query regards to heal xattr heal in dht

2016-09-15 Thread Xavier Hernandez
On 15/09/16 11:31, Raghavendra G wrote: On Thu, Sep 15, 2016 at 12:02 PM, Nithya Balachandran mailto:nbala...@redhat.com>> wrote: On 8 September 2016 at 12:02, Mohit Agrawal mailto:moagr...@redhat.com>> wrote: Hi All, I have one another solution to heal user xattr

Re: [Gluster-devel] Need help with https://bugzilla.redhat.com/show_bug.cgi?id=1224180

2016-09-13 Thread Xavier Hernandez
On 13/09/16 21:00, Pranith Kumar Karampuri wrote: On Tue, Sep 13, 2016 at 1:39 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi Sanoj, On 13/09/16 09:41, Sanoj Unnikrishnan wrote: Hi Xavi, That explains a lot, I see a couple of other sc

Re: [Gluster-devel] Need help with https://bugzilla.redhat.com/show_bug.cgi?id=1224180

2016-09-13 Thread Xavier Hernandez
in fdl instead of trying to find a custom solution to this. Xavi Thanks and Regards, Sanoj - Original Message - From: "Xavier Hernandez" To: "Raghavendra Gowdappa" , "Sanoj Unnikrishnan" Cc: "Pranith Kumar Karampuri" , "Ashish Pan

Re: [Gluster-devel] Need help with https://bugzilla.redhat.com/show_bug.cgi?id=1224180

2016-09-12 Thread Xavier Hernandez
Hi Sanoj, I'm unable to see bug 1224180. Access is restricted. Not sure what is the problem exactly, but I see that quota is involved. Currently disperse doesn't play well with quota when the limit is near. The reason is that not all bricks fail at the same time with EDQUOT due to small diff

Re: [Gluster-devel] Spurious termination of fuse invalidation notifier thread

2016-09-05 Thread Xavier Hernandez
Hi Raghavendra, On 06/09/16 06:11, Raghavendra Gowdappa wrote: - Original Message - From: "Xavier Hernandez" To: "Raghavendra Gowdappa" , "Kaleb Keithley" , "Pranith Kumar Karampuri" Cc: "Csaba Henk" , "Gluster Devel&qu

Re: [Gluster-devel] Spurious termination of fuse invalidation notifier thread

2016-09-05 Thread Xavier Hernandez
Hi Raghavendra, On 03/09/16 05:42, Raghavendra Gowdappa wrote: Hi Xavi/Kaleb/Pranith, During few of our older conversations (like [1], but not only one), some of you had reported that the thread which writes invalidation notifications (of inodes, entries) to /dev/fuse terminates spuriously. C

Re: [Gluster-devel] Notifications (was Re: GF_PARENT_DOWN on SIGKILL)

2016-07-24 Thread Xavier Hernandez
Hi Jeff, On 22/07/16 16:14, Jeff Darcy wrote: I don't think we need any list traversal because notify sends it down the graph. Good point. I think we need to change that, BTW. Relying on translators to propagate notifications has proven very fragile, as many of those events are overloaded to

Re: [Gluster-devel] GF_PARENT_DOWN on SIGKILL

2016-07-24 Thread Xavier Hernandez
Hi Jeff, On 22/07/16 15:37, Jeff Darcy wrote: Gah! sorry sorry, I meant to send the mail as SIGTERM. Not SIGKILL. So xavi and I were wondering why cleanup_and_exit() is not sending GF_PARENT_DOWN event. OK, then that grinding sound you hear is my brain shifting gears. ;) It seems that cleanu

Re: [Gluster-devel] performance issues Manoj found in EC testing

2016-06-28 Thread Xavier Hernandez
anith Kumar Karampuri" mailto:pkara...@redhat.com>> *To: *"Xavier Hernandez" mailto:xhernan...@datalab.es>> *Cc: *"Gluster Devel" mailto:gluster-devel@gluster.org>> *Sent: *Monday, June 27, 2016 5:48:24 PM *Subject: *Re: [Glust

Re: [Gluster-devel] performance issues Manoj found in EC testing

2016-06-26 Thread Xavier Hernandez
From: "Pranith Kumar Karampuri" To: "Xavier Hernandez" Cc: "Manoj Pillai" , "Gluster Devel" Sent: Thursday, June 23, 2016 8:50:44 PM Subject: performance issues Manoj found in EC testing hi Xavi, Meet Manoj from performance team Redhat. He has been t

Re: [Gluster-devel] Wrong assumptions about disperse

2016-06-20 Thread Xavier Hernandez
Hi Shyam, On 17/06/16 15:59, Shyam wrote: On 06/17/2016 04:59 AM, Xavier Hernandez wrote: Firstly, thanks for the overall post, was informative and helps clarify some aspects of EC. AFAIK the real problem of EC is the communications layer. It adds a lot of latency and having to communicate

[Gluster-devel] Wrong assumptions about disperse

2016-06-17 Thread Xavier Hernandez
Hi all, I've seen in many places the belief that disperse, or erasure coding in general, is slow because of the complex or costly math involved. It's true that there's an overhead compared to a simple copy like replica does, but this overhead is way more smaller than many people think. The m

Re: [Gluster-devel] Failure to release unusable file open fd_count on glusterfs v3.7.11

2016-06-09 Thread Xavier Hernandez
Hi, thanks for testing it. I've identified an fd leak in the disperse xlator. I've filed a bug [1] for this. Xavi [1] https://bugzilla.redhat.com/show_bug.cgi?id=1344396 On 08.06.2016 05:00, 彭繼霆 wrote: > Hi, I have a volume created with 3 bricks.After delete file which was created by "

Re: [Gluster-devel] dht mkdir preop check, afr and (non-)readable afr subvols

2016-06-06 Thread Xavier Hernandez
Hi Raghavendra, On 06/06/16 10:54, Raghavendra G wrote: On Wed, Jun 1, 2016 at 12:50 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi, On 01/06/16 08:53, Raghavendra Gowdappa wrote: - Original Message - From: "Xavie

Re: [Gluster-devel] dht mkdir preop check, afr and (non-)readable afr subvols

2016-06-01 Thread Xavier Hernandez
Hi, On 01/06/16 08:53, Raghavendra Gowdappa wrote: - Original Message - From: "Xavier Hernandez" To: "Pranith Kumar Karampuri" , "Raghavendra G" Cc: "Gluster Devel" Sent: Wednesday, June 1, 2016 11:57:12 AM Subject: Re: [Gluster-de

Re: [Gluster-devel] dht mkdir preop check, afr and (non-)readable afr subvols

2016-05-31 Thread Xavier Hernandez
>> wrote: On Tue, May 31, 2016 at 12:37 PM, Xavier Hernandez mailto:xhernan...@datalab.es>> wrote: Hi, On 31/05/16 07:05, Raghavendra Gowdappa wrote: +gluster-devel, +Xavi Hi all, The context is [1], where bricks do p

Re: [Gluster-devel] dht mkdir preop check, afr and (non-)readable afr subvols

2016-05-31 Thread Xavier Hernandez
Hi, On 31/05/16 07:05, Raghavendra Gowdappa wrote: +gluster-devel, +Xavi Hi all, The context is [1], where bricks do pre-operation checks before doing a fop and proceed with fop only if pre-op check is successful. @Xavi, We need your inputs on behavior of EC subvolumes as well. If I under

Re: [Gluster-devel] Possible bug in the communications layer ?

2016-05-09 Thread Xavier Hernandez
I've uploaded a patch for this problem: http://review.gluster.org/14270 Any review will be very appreciated :) Thanks, Xavi On 09/05/16 12:35, Raghavendra Gowdappa wrote: - Original Message - From: "Xavier Hernandez" To: "Raghavendra Gowdappa" Cc: &quo

Re: [Gluster-devel] Possible bug in the communications layer ?

2016-05-09 Thread Xavier Hernandez
. This causes the decode to leave read_rsp.xdata.xdata_len set to 0. 6. The program interprets that xdata_len being 0 means that there's no xdata, so it continues reading the remaining of the RPC packet into the payload buffer. If you want, I can send a patch for this. Xavi On 05/

Re: [Gluster-devel] Bugs with incorrect status

2016-05-06 Thread Xavier Hernandez
I think there's a problem with the script that generates this report. The changes I2fac59 and Ie1934f are bound to bug 1332054, not 1236065. Xavi On 06/05/16 10:41, Niels de Vos wrote: 1236065 (mainline) MODIFIED: Disperse volume: FUSE I/O error after self healing the failed disk files [mas

Re: [Gluster-devel] [Gluster-users] Fwd: dht_is_subvol_filled messages on client

2016-05-05 Thread Xavier Hernandez
On 05/05/16 13:59, Kaushal M wrote: On Thu, May 5, 2016 at 4:37 PM, Xavier Hernandez wrote: On 05/05/16 11:31, Kaushal M wrote: On Thu, May 5, 2016 at 2:36 PM, David Gossage wrote: On Thu, May 5, 2016 at 3:28 AM, Serkan Çoban wrote: Hi, You can find the output below link: https

Re: [Gluster-devel] [Gluster-users] Fwd: dht_is_subvol_filled messages on client

2016-05-05 Thread Xavier Hernandez
, but free will be ~6.300.000. This gives a ~0.8% available, i.e. almost 100% full. Given the circumstances I think it's the correct thing to do. Xavi BTW, how large is the volume you have? Those are a lot of bricks! ~kaushal On Thu, May 5, 2016 at 9:33 AM, Xavier Hernandez wrote

Re: [Gluster-devel] Possible bug in the communications layer ?

2016-05-05 Thread Xavier Hernandez
I've undone all changes and now I'm unable to reproduce the problem, so the modification I did is probably incorrect and not the root cause, as you described. I'll continue investigating... Xavi On 04/05/16 15:01, Xavier Hernandez wrote: On 04/05/16 14:47, Raghavendra

Re: [Gluster-devel] [Gluster-users] Fwd: dht_is_subvol_filled messages on client

2016-05-04 Thread Xavier Hernandez
Can you post the result of 'gluster volume status v0 detail' ? On 05/05/16 06:49, Serkan Çoban wrote: Hi, Can anyone suggest something for this issue? df, du has no issue for the bricks yet one subvolume not being used by gluster.. On Wed, May 4, 2016 at 4:40 PM, Serkan Çoban wrote: Hi, I ch

Re: [Gluster-devel] Possible bug in the communications layer ?

2016-05-04 Thread Xavier Hernandez
On 04/05/16 14:47, Raghavendra Gowdappa wrote: - Original Message - From: "Xavier Hernandez" To: "Raghavendra Gowdappa" Cc: "Gluster Devel" Sent: Wednesday, May 4, 2016 5:37:56 PM Subject: Re: [Gluster-devel] Possible bug in the communications l

Re: [Gluster-devel] Possible bug in the communications layer ?

2016-05-04 Thread Xavier Hernandez
oto_read() and it seemed to work. Could anyone with more knowledge about the communications layer verify this and explain what would be the best solution ? Xavi On 29/04/16 14:52, Xavier Hernandez wrote: With your patch applied, it seems that the bug is not hit. I guess it's a timing iss

  1   2   3   >