I admit I didn’t do a whole lot of troubleshooting. We don’t run NFS so I can’t 
speak about that.

Initially the server looked like it came back ok, albeit “Node starting up..” 
was observed in the output of mmlscluster –ces. At that time I was not sure if 
that was a) expected behaviour and/or b) related to GPFS 4.2.1-2.

Once the node went back into service I had no complaints from customers that 
they faced any connectivity issues. The next morning I shut down a second CES 
node in order to upgrade it, but I observed that the first once went into a 
failed state (might have been a nasty coincidence!):

[root@icgpfs-ces1 yum.repos.d]# mmces state show -a
NODE                                     AUTH          AUTH_OBJ      NETWORK    
   NFS           OBJ           SMB           CES
icgpfs-ces1                              FAILED        DISABLED      HEALTHY    
   DISABLED      DISABLED      DEPEND        STARTING
icgpfs-ces2                              DEPEND        DISABLED      SUSPENDED  
   DEPEND        DEPEND        DEPEND        DEPEND
icgpfs-ces3                              HEALTHY       DISABLED      HEALTHY    
   DISABLED      DISABLED      HEALTHY       HEALTHY
icgpfs-ces4                              HEALTHY       DISABLED      HEALTHY    
   DISABLED      DISABLED      HEALTHY       HEALTHY

(Where ICGPFS-CES1 was the node running 7.3).

Also in mmces event show –N icgpfs-ces1 –time day the following error was 
logged about twice per minute:

icgpfs-ces1                              2016-12-06 06:32:04.968269 GMT        
wnbd_restart              INFO       WINBINDD process was not running. Trying 
to start it

I moved the CES IP from icgpfs-ces2 to icgpfs-ces3 prior to suspending –ces2.

It was about that point I decided to abandon the planned upgrade of –ces2, 
resume the node and then suspend –ces1.

Attempts to downgrade the Kernel/OS/redhat-release RPM back to 7.2 worked well, 
except when I tried to start CES again and the node reported “Node failed”. I 
then rebuilt it completely, restored it to the cluster and it appears to be 
fine.

Sorry I can’t be any more specific than that but I hope it helps.

Thanks
Richard

From: [email protected] 
[mailto:[email protected]] On Behalf Of Ravi K Komanduri
Sent: 07 December 2016 06:46
To: [email protected]
Cc: [email protected]
Subject: Re: [gpfsug-discuss] CES ON RHEL7.3

Sobey,

Could you mention the problems that you have faced on CES env for RH 7.3.  Is 
it related to the Kernel or in Ganesha environment ?

Your thoughts/inputs would help us in fixing the same.

Currently working on the CES environment on RH 7.3 support side.

With Regards,
Ravi K Komanduri
GPFS team
IBM



From:        "Sobey, Richard A" 
<[email protected]<mailto:[email protected]>>
To:        "'[email protected]'" 
<[email protected]<mailto:[email protected]>>
Date:        12/07/2016 11:59 AM
Subject:        [gpfsug-discuss] CES ON RHEL7.3
Sent by:        
[email protected]<mailto:[email protected]>
________________________________



A word of wisdom: do not try and run CES on RHEL 7.3 ☺Although it appears to 
work, a few things break and it becomes a bit unpredictable as I found out the 
hard way. I didn’t intend to run 7.3 of course as I knew it wasn’t supported.

Richard_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss


_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

Reply via email to