Re: [lustre-discuss] server_bulk_callback errors until server reboots

2018-06-07 Thread White, Cliff
On 6/7/18, 7:00 AM, "lustre-discuss on behalf of Hebenstreit, Michael" wrote: Hello I have now 2 Lustre systems that suddenly show this error - on a single OST the kernel log is filling with messages [58858.365663] LustreError: 123642:0:(events.c:447:server_bulk_callbac

Re: [lustre-discuss] HPC Head node clustering and lustre

2017-11-21 Thread White, Cliff
If the Lustre filesystem is mounted as a client on the head node(s), there should be no concerns over the failover of those nodes. And no real need to failover Lustre, it can be mounted as a client on both nodes. Much like a common NFS share, but better locking. If the head node is a Lustre serv

Re: [lustre-discuss] how homogenous should a lustre cluster be?

2017-03-20 Thread White, Cliff
Comments inlne. From: lustre-discuss mailto:lustre-discuss-boun...@lists.lustre.org>> on behalf of "E.S. Rosenberg" mailto:esr+lus...@mail.hebrew.edu>> Date: Monday, March 20, 2017 at 10:19 AM To: "lustre-discuss@lists.lustre.org" mailto:lustre-discuss@l

Re: [Lustre-discuss] targets start order in Lustre 2.4.3

2014-05-23 Thread White, Cliff
In a failover situation, any target can be stopped and restarted without impact on other nodes. The startup order in the manual is for a cold startup/full shutdown situation, and does not apply to a running filesystem and failover. You should not have the ordering directive, I think. In particul

Re: [Lustre-discuss] Which NID to use?

2014-03-03 Thread White, Cliff
he preferred solution. Best, cliffw Regards, Patrick On Fri, 28 Feb 2014 21:20:58 +, White, Cliff wrote: On 2/28/14, 1:17 AM, "Chan Ching Yu Patrick" mailto:cyc...@clustertech.com>> wrote: Hi Mohr, The reason why I made this setup is I'm not sure how Lustr

Re: [Lustre-discuss] Which NID to use?

2014-02-28 Thread White, Cliff
On 2/28/14, 1:17 AM, "Chan Ching Yu Patrick" wrote: >Hi Mohr, > >The reason why I made this setup is I'm not sure how Lustre selects the >interface in mult-rail environment. > >Especially when all node have Infiniband and Ethernet, how can I ensure >Infiniband is used between client and OSS? The

Re: [Lustre-discuss] OST Failover Configuration (Active/Active) verification

2014-01-31 Thread White, Cliff
On 1/30/14, 9:21 PM, "Peter Mistich" wrote: >hello, > >anyone here can answer a questions about OST Failover Configuration >(Active/Active) I think I understand but want to make sure. > >I configure 2 oss servernames = node1 and node2 with 2 shared drives >/dev/sdb and /dev/sdc and on node1 >

Re: [Lustre-discuss] lustre 1.8.5 client failed to mount lustre

2013-10-17 Thread White, Cliff
/proc/syus/lnet >does not exist. That's not the problem. What error messages does 'modprobe -v lustre' give? > > >-Weilin > >-Original Message- >From: White, Cliff [mailto:cliff.wh...@intel.com] >Sent: Thursday, October 17, 2013 10:59 AM >To: Weili

Re: [Lustre-discuss] lustre 1.8.5 client failed to mount lustre

2013-10-17 Thread White, Cliff
From: Weilin Chang mailto:weilin.ch...@huawei.com>> Date: Thursday, October 17, 2013 10:25 AM To: Chan Ching Yu Patrick mailto:cyc...@clustertech.com>> Cc: Weilin Chang mailto:weilin.ch...@huawei.com>>, "lustre-discuss@lists.lustre.org" mailto:lustre-discu

Re: [Lustre-discuss] OSS misconfig and client connect

2013-07-31 Thread White, Cliff
On 7/31/13 10:37 AM, "James Robnett" wrote: > >I'm now suspicious that I need to unmount all the OSSes (for >correctness), unmount the MDS and run > >tunefs.lustre --writeconf /dev/md0 > >on it to clear the logs and then remount. > >Note we have a combined MDS/MGS. Yes. Since the configuration i

Re: [Lustre-discuss] Small files

2013-07-03 Thread White, Cliff
Worth noting – unless your IO requirements are quite strict, in most cases you won't need a large number of striping policies. The 'best' stripe for any large IO task is usually dependent on the particular hardware/network/workload involved. If you are saturating part of the system, such as netw

Re: [Lustre-discuss] Lustre over two TCP interfaces

2013-06-25 Thread White, Cliff
From: Alfonso Pardo mailto:alfonso.pa...@ciemat.es>> Date: Tuesday, June 25, 2013 5:22 AM To: Michael Shuey mailto:sh...@purdue.edu>> Cc: WC-Discuss mailto:wc-discuss.migrat...@intel.com>>, "lustre-discuss@lists.lustre.org" mailto:lustre-discuss@lists.lust

Re: [Lustre-discuss] Using NFS to mount lustre

2013-06-21 Thread White, Cliff
From: Teik Hooi Beh mailto:th...@thbeh.com>> Date: Friday, June 21, 2013 1:29 AM To: Parinay Kondekar mailto:parinay_konde...@xyratex.com>> Cc: "lustre-discuss@lists.lustre.org" mailto:lustre-discuss@lists.lustre.org>> Subject: Re: [Lustre-discuss] Using NF

Re: [Lustre-discuss] lustre-tests and lustre-iokit

2013-06-20 Thread White, Cliff
Forgot one thing. In Lustre 2.1.5 the iokit scripts are in the 'lustre' rpm and are installed in /usr/bin The kit was moved to a separate RPM for later releases. cliffw From: "", Patrick mailto:cyc...@clustertech.com>> Date: Thursday, June 20, 2013 3:51 PM To: "lustre-discuss@lists.lustre.org

Re: [Lustre-discuss] lustre-tests and lustre-iokit

2013-06-20 Thread White, Cliff
Lustre-tests contains all the scripts that are used to test the Lustre code base. Very low-level stuff. It is in a separate RPM, as most users have no need of it. It will create /usr/lib64/lustre/tests. Lustre-iokit is a set of shell scripts that are used to test a Lustre file system for perform

Re: [Lustre-discuss] Performance Question

2013-04-17 Thread White, Cliff
That's a real 'it depends' kind of question :) If you are currently maxing out the server hardware in some fashion, then yes, provided your network has the capacity. However, performance can also be bottlenecked by network and client capacity. It should be fairly easy to determine how busy your cur

Re: [Lustre-discuss] LNET over multiple NICs

2012-12-20 Thread White, Cliff
On 12/19/12 2:03 PM, "Alexander Oltu" wrote: > >> I have no experience in doing multirail on ethernet, sorry. The >> principle is exactly the same as for Infiniband, but as Infiniband >> interfaces cannot be bonded (except for IPoIB which is not of >> interest when considering performance), I