The node successfully gets its IP from DHCP at bootup. I can wireshark to verify this but I don’t believe it is a DHCP issue. I tried 2 different nodes when in this state and both exhibit the same behavior (getting stuck trying to http get). I found someone with a similar issue with the web server, but unfortunately, no fix. https://stackoverflow.com/questions/21849794/apache2-runs-fine-for-a-while-then-stops-serving-content-error-when-restarting
-Keith ____________________ Keith Hannum keith.han...@lmco.com<mailto:keith.han...@lmco.com> From: Mark Gurevich <gurev...@us.ibm.com> Sent: Tuesday, January 22, 2019 9:49 AM To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net> Subject: EXTERNAL: Re: [xcat-user] xCat Diskless booting - http server doesn't respond Could it be that DHCP lease for this node has expired after a few days ? While this node is hanging, trying to get the image from the server, are you able to provision a different diskless node with the same OS image ? Mark Gurevich Poughkeepsie Development Lab HPC Software Development - xCAT "If we knew what it was we were doing, it would not be called research, would it?" --Albert Einstein [Inactive hide details for "Hannum, Keith" ---01/21/2019 07:40:46 PM---I have xcat deployed on a CENTOS 6.8 node as my managemen]"Hannum, Keith" ---01/21/2019 07:40:46 PM---I have xcat deployed on a CENTOS 6.8 node as my management node. The nodes were booting fine but som From: "Hannum, Keith" <keith.han...@lmco.com<mailto:keith.han...@lmco.com>> To: "xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>" <xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>> Date: 01/21/2019 07:40 PM Subject: [xcat-user] xCat Diskless booting - http server doesn't respond ________________________________ I have xcat deployed on a CENTOS 6.8 node as my management node. The nodes were booting fine but some changes were made to the server. I don’t have the details of the changes but I am having an isse with xcat now. If the server is left running for a few days, if a diskless node is rebooted, it can no longer get its image from http. It hangs at getting the image forever with dots after the http get. i.e http://172.20.0.11/tftpboot/xcat/xnba/nodes/..... And the dots go on forever. If I reboot the management node, the diskless clients can immediately get their image, but after a few days, it stops working again. Httpd service is running. Not sure what to check for debug and was hoping for some direction to start troubleshooting. Thanks. _______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net> https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user