Re: [Lustre-discuss] Time needed to enable quota

2011-06-16 Thread Roland Laifer
On Wed, Jun 15, 2011 at 01:30:08PM +0100, Guy Coates wrote: On 15/06/11 13:14, Frank Heckes wrote: Hi all, we're planning to enable quota on our Lustre file systems running with version 1.8.4. We like to estimate the downtime needed to run quotacheck. Hi, We recently did a

Re: [Lustre-discuss] Error when mv from Lustre to system

2011-06-16 Thread styr
It works. :) Thanks Christian and Andreas for your time. - Mail Original - De: Christian Becker christian.bec...@math.tu-dortmund.de À: Andreas Dilger adil...@whamcloud.com Cc: s...@free.fr, lustre-discuss@lists.lustre.org Envoyé: Mercredi 15 Juin 2011 19h53:54 GMT +01:00 Amsterdam /

[Lustre-discuss] Path lost when accessing files

2011-06-16 Thread styr
Hi Lustre users, we actually a little problems with jobs running on our cluster and using Lustre. Sometimes, we have these errors : forrtl: No such file or directory forrtl: severe (29): file not found, unit 213, file �@/suivi.d000 It does not only happen with forttl but also sometimes

[Lustre-discuss] lfs quotacheck -ug /lfs01/ sleeps

2011-06-16 Thread Mohamed Adel
Dear all, I'm trying to enable quota on my lustre file system. Issuing the lfs quotacheck -ug /lfs01/ command doesn't produce anything. And ps aux | grep lfs command shows that the process is sleeping. I don't know where to go from here. Any idea to discover what went wrong? thanks in advance,

Re: [Lustre-discuss] lfs quotacheck -ug /lfs01/ sleeps

2011-06-16 Thread Ashley Pittman
On 16 Jun 2011, at 11:33, Mohamed Adel wrote: Dear all, I'm trying to enable quota on my lustre file system. Issuing the lfs quotacheck -ug /lfs01/ command doesn't produce anything. And ps aux | grep lfs command shows that the process is sleeping. I don't know where to go from here.

Re: [Lustre-discuss] lfs quotacheck -ug /lfs01/ sleeps

2011-06-16 Thread Mohamed Adel
Dear Ashley, Thanks for your quick response. This is correct, look for a kernel process called quotacheck on the Lustre servers, when all those threads have exited then lfs should also exit. As came up yesterday this could take a few tens of minutes. Issuing ps aux | grep quotacheck

Re: [Lustre-discuss] lfs quotacheck -ug /lfs01/ sleeps

2011-06-16 Thread Ashley Pittman
On 16 Jun 2011, at 11:54, Mohamed Adel wrote: Dear Ashley, Thanks for your quick response. This is correct, look for a kernel process called quotacheck on the Lustre servers, when all those threads have exited then lfs should also exit. As came up yesterday this could take a few tens

Re: [Lustre-discuss] $MOUNT2 in acc-sm

2011-06-16 Thread Brian J. Murrell
On 11-06-15 05:58 PM, Jay Lan wrote: I found my problem! I defined MOUNT=/mnt/nbp0 and MOUNT2=/mnt/nbp0-2. Bad idea!!! The sanity_mount_check* scripts use `grep` to search for $MOUNT and $MOUNT2. Since $MOUNT is a substring of $MOUNT2, `grep` on situations return wrong count! That

Re: [Lustre-discuss] $MOUNT2 in acc-sm

2011-06-16 Thread Peter Jones
A slight typo - http://jira.whamcloud.com On 11-06-16 5:07 AM, Brian J. Murrell wrote: snip That sounds like a bug. Can you please file a ticket at http://jira.whamcould.com/ detailing your problem and solution? -- Peter Jones Whamcloud, Inc. www.whamcloud.com

Re: [Lustre-discuss] $MOUNT2 in acc-sm

2011-06-16 Thread Brian J. Murrell
On 11-06-16 10:15 AM, Peter Jones wrote: A slight typo - http://jira.whamcloud.com Thanks Peter. On 11-06-16 5:07 AM, Brian J. Murrell wrote: snip That sounds like a bug. Can you please file a ticket at http://jira.whamcould.com/ detailing your problem and solution?

[Lustre-discuss] LustreError: 26019:0:(file.c:3143:ll_inode_revalidate_fini()) failure -2 inode

2011-06-16 Thread fenix . serega
Hi Lustre 1.8 A lot of LustreErrors on client: ustreError: 8747:0:(file.c:3143:ll_inode_revalidate_fini()) Skipped 6 previous similar messages LustreError: 8747:0:(file.c:3143:ll_inode_revalidate_fini()) failure -2 inode 63486047 LustreError: 8747:0:(file.c:3143:ll_inode_revalidate_fini())

[Lustre-discuss] Unexpect file system error during normal system works

2011-06-16 Thread Piotr Przybylo
We have a problem with lustre, in connection with this I wanted to ask you, can you help us ? We have a unexpect file system error during normal system working. / Jun 13 15:00:30 ossw12 kernel: LDISKFS-fs error (device dm-9): mb_free_blocks: double-free of inode 82041293's block 346591170(bit

Re: [Lustre-discuss] Unexpect file system error during normal system works

2011-06-16 Thread Wojciech Turek
Hi Piotr, Which lustre version is this? Also which version of e2fsprogs are you using? Is the back end disk a software RAID or HW raid? If you can not see any errors on your hardware I would recommend to run fsck few times until it does does not find any problems. I also highly recommend to

[Lustre-discuss] What exactly is punch statistic?

2011-06-16 Thread Mervini, Joseph A
Hi, I have been covertly trying for a long time to find out what punch means as far a lustre llobdstat output but have not really found anything definitive. Can someone answer that for me? (BTW: I am not alone in my ignorance... :) ) Thanks. Joe Mervini Sandia National Laboratories High

Re: [Lustre-discuss] What exactly is punch statistic?

2011-06-16 Thread Cliff White
It is called when truncating a file - afaik it is showing you the number of truncates, more or less. cliffw On Thu, Jun 16, 2011 at 10:52 AM, Mervini, Joseph A jame...@sandia.govwrote: Hi, I have been covertly trying for a long time to find out what punch means as far a lustre llobdstat

Re: [Lustre-discuss] Path lost when accessing files

2011-06-16 Thread Peter Kjellström
On Thursday, June 16, 2011 03:30:38 PM Sebastien Piechurski wrote: Hi, This problem is documented in bug 23978 (http://bugzilla.lustre.org/show_bug.cgi?id=23978). To summarize: the fortran runtime is making a call to getcwd() to get the full path to a file which was given as a relative