Re: [Lustre-discuss] clients gets EINTR from time to time

2011-03-04 Thread Francois Chassaing
- De: John Hammond jhamm...@tacc.utexas.edu À: Andreas Dilger adil...@whamcloud.com Cc: lustre-discuss@lists.lustre.org Envoyé: Vendredi 25 Février 2011 21h16:36 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time On 02/25

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Francois Chassaing
: DEGREMONT Aurelien aurelien.degrem...@cea.fr Cc: Francois Chassaing f...@weborama.com, lustre-discuss@lists.lustre.org Envoyé: Jeudi 24 Février 2011 18h43:25 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Brian J. Murrell
On 11-02-25 06:18 AM, Francois Chassaing wrote: Thanks, but anyway, logs on the MDS/MGS does not show evicted client of any kind. Also, the log output by lctl debug_kernel on clients does not show much, I can only see in there the last administrative actions I've taken (such as setting

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Francois Chassaing
François Chassaing Directeur Technique - CTO - Mail Original - De: Brian J. Murrell br...@whamcloud.com À: lustre-discuss@lists.lustre.org Envoyé: Vendredi 25 Février 2011 14h28:02 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Ken Hornstein
I don't understand why you don't just fix your application to handle a perfectly valid and expected condition (that it's currently not handling) instead of wasting time trying to find the cause of the expected condition. Even if you find it, it's likely not a bug and not something that can/will

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Cory Spitz
Hi. I think it would help if you knew what the signal was. Do you have that yet? I have a report from a user that is is getting EINTR when a SIGALRM goes off on his write(). It isn't unexpected to get SIGALRM because he called the alarm, but he also has SA_RESTART set. I can't remember

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread Ken Hornstein
I have a report from a user that is is getting EINTR when a SIGALRM goes off on his write(). It isn't unexpected to get SIGALRM because he called the alarm, but he also has SA_RESTART set. I can't remember whose responsibility it is to restart the call, syscall or whereever, but it seems

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-25 Thread John Hammond
On 02/25/2011 11:39 AM, Andreas Dilger wrote: On 2011-02-25, at 6:28, Brian J. Murrell br...@whamcloud.com wrote: On 11-02-25 06:18 AM, Francois wrote: I continue to parse debug logs and keep them posted. I don't understand why you don't just fix your application to handle a perfectly

[Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Francois Chassaing
Dear list members, We are using Lustre 1.8.5 (upgraded from 1.8.4) running on 1 MGS, 3 OSS over DDR IB, and 2 patched clients mounted with the flock option. We are experiencing issues with an application that gets a EINTR when trying to write to a file. Those errors happens randomly on both

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Brian J. Murrell
On 11-02-24 05:50 AM, Francois Chassaing wrote: Dear list members, Hi, We are experiencing issues with an application that gets a EINTR when trying to write to a file. If I understand that errno properly, that is to be expected. Those errors happens randomly on both clients, Well, not

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Francois Chassaing
/ Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time On 11-02-24 05:50 AM, Francois Chassaing wrote: Dear list members, Hi, We are experiencing issues with an application that gets a EINTR when trying to write to a file. If I understand that errno properly

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Brian J. Murrell
On 11-02-24 08:16 AM, Francois Chassaing wrote: Well, as I understand your point and I do also understand that this signal is not a malfunction, No, but not handling it properly is. Interpreting an EINTR as the disk must be full (i.e. a fatal error) is wrong. my question was regarding to

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Francois Chassaing
@lists.lustre.org Envoyé: Jeudi 24 Février 2011 14h29:27 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time On 11-02-24 08:16 AM, Francois Chassaing wrote: Well, as I understand your point and I do also understand

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Ken Hornstein
OK, the app is used to deal with standard disks, that is why it is not handling the EINTR signal propoerly. I think you're misunderstanding what a signal is in the Unix sense. EINTR isn't a signal; it's a return code from the write() system call that says, Hey, you got a signal in the middle of

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Francois Chassaing
Chassaing f...@weborama.com Cc: lustre-discuss@lists.lustre.org Envoyé: Jeudi 24 Février 2011 15h54:24 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time OK, the app is used to deal with standard disks, that is why

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Ken Hornstein
As for your questions : - I have to mention that I always had had this issue, and this is why I've upgraded from 1.8.4 to 1.8.5, hoping this would solve it. Ah, okay, I misunderstood that; my apologies. - I will try to have that SA_RESTART flag set in the app... if I can find where the signal

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread DEGREMONT Aurelien
: Ken Hornstein k...@cmf.nrl.navy.mil À: Francois Chassaing f...@weborama.com Cc: lustre-discuss@lists.lustre.org Envoyé: Jeudi 24 Février 2011 15h54:24 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time OK

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Brian J. Murrell
On 11-02-24 11:57 AM, DEGREMONT Aurelien wrote: Hello Hi, From my understanding, Lustre can return EINTR for some I/O error cases. I think that should/would be an EIO. I think that when a client gets evicted in the middle of one of its RPC, it can returns EINTR to the caller. An evicted

Re: [Lustre-discuss] clients gets EINTR from time to time

2011-02-24 Thread Kevin Van Maren
:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Re: [Lustre-discuss] clients gets EINTR from time to time OK, the app is used to deal with standard disks, that is why it is not handling the EINTR signal propoerly. I think you're misunderstanding what