Anything in dmesg? We need to know _why_ the network failed to start. Chris Horn
From: Kurt Strosahl <[email protected]> Date: Wednesday, October 2, 2019 at 1:55 PM To: Chris Horn <[email protected]>, "[email protected]" <[email protected]> Subject: Re: [lustre-discuss] Lustre rpm install creating a file that breaks lustre the lnet modules load, but when I start the lnet service it says that the network is down. I backed everything out, removed the file, and then started the lnet service again and it worked properly. ________________________________ From: Chris Horn <[email protected]> Sent: Wednesday, October 2, 2019 2:48 PM To: Kurt Strosahl <[email protected]>; [email protected] <[email protected]> Subject: [EXTERNAL] Re: [lustre-discuss] Lustre rpm install creating a file that breaks lustre Might be best to open a ticket for this. What was the nature of the failure? Chris Horn From: lustre-discuss <[email protected]> on behalf of Kurt Strosahl <[email protected]> Date: Wednesday, October 2, 2019 at 1:30 PM To: "[email protected]" <[email protected]> Subject: [lustre-discuss] Lustre rpm install creating a file that breaks lustre Good Afternoon, While getting lustre 2.10.8 running on a RHEL 7.7 system I found that the RPM install was putting a file in /etc/modprobe.d that was preventing lnet from starting properly. the file is ko2iblnd.conf, which contains the following... alias ko2iblnd-opa ko2iblnd options ko2iblnd-opa peer_credits=128 peer_credits_hiw=64 credits=1024 concurrent_sends=256 ntx=2048 map_on_demand=32 fmr_pool_size=2048 fmr_flush_trigger=512 fmr_cache=1 conns_per_peer=4 install ko2iblnd /usr/sbin/ko2iblnd-probe Our system is running infiniband, not omnipath. So I'm mot sure why this file is being put in place. Removing the file allows lnet to start properly. w/r, Kurt J. Strosahl System Administrator: Lustre, HPC Scientific Computing Group, Thomas Jefferson National Accelerator Facility
_______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
