Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Jan-Frode Myklebust
pport says so > in your PMR), so if that's been the recommendation, I suggest you look at > it. > > Changelog on ESS 4.0.4 (no idea what ESS level you are running) > > > c) Support of MLNX_OFED_LINUX-3.2-2.0.0.1 > - Updated from MLNX_OFED_LINUX-3.1-1.0.6.1 (ESS 4.0,

Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Damir Krstic
s you should do not update those solo (unless support says so > in your PMR), so if that's been the recommendation, I suggest you look at > it. > > Changelog on ESS 4.0.4 (no idea what ESS level you are running) > > > c) Support of MLNX_OFED_LINUX-3.2-2.0.0.1 > - Updated from M

Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Jan-Frode Myklebust
the IPoIB in connected > mode or datagram ... but as I said, please discuss this within the PMR .. > there are to much dependencies to discuss this here .. > > > cheers > > > Mit freundlichen Grüßen / Kind regards > > > Olaf Weiser > > EMEA Storage Competence C

Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Damir Krstic
5 <+358%2050%203112585> > > "If you continually give you will continually have." Anonymous > > > > - Original message - > From: "Olaf Weiser" <olaf.wei...@de.ibm.com> > Sent by: gpfsug-discuss-boun...@spectrumscale.org > To: gpfsug main discussion

Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Knister, Aaron S. (GSFC-606.2)[COMPUTER SCIENCE CORP]
. Do you have a subnet defined for your IPoIB network or are your nodes daemon interfaces already set to their IPoIB interface? Have you checked your SM logs? From: Damir Krstic Sent: 1/11/17, 9:39 AM To: gpfsug main discussion list Subject: [gpfsug-discuss] nodes being ejected out

Re: [gpfsug-discuss] nodes being ejected out of the cluster

2017-01-11 Thread Olaf Weiser
most likely, there's smth wrong with your IB fabric ... you say, you run ~ 700 nodes ? ...Are you running with verbsRdmaSendenabled ? ,if so, please consider to disable  - and discuss this within the PMR another issue, you may check is  - Are you running the IPoIB in connected mode or datagram ...