We set the debug mode to 2 to get extra data, but so far I haven't seen
anything that sticks out. I always get an exit code of 0 on 'syncfiles',
but I'll dig around some more. Also, there are no service nodes on the
cluster.
Thanks,
Brian Joiner
On Sun, Aug 12, 2018 at 10:05 PM, Song BJ Yang <yang...@cn.ibm.com> wrote:
> hi Brian,
>
> Good catch. Based on the description, I think maybe it is the similar
> issue described in this ticket.
>
> 2 hints:
>
> 1. would you please take a look at the file `/var/log/xcat/xcat.log` on
> the compute node after reboot? this file contains some logs on
> postscripts. If the information is not sufficient to position the real
> cause, you can enable `site.xcatdebugmode` by `chdef -t site -o
> clustersite xcatdebugmode=1` and retrovision the node to get more verbose
> information.
>
> 2. is it a hierarchy cluster with service node? if yes, have you upgrade
> xCATsn on SN?
>
> best regards
> ------------------------------------------------------------
> ------------------
> YANG Song (杨嵩)
> IBM China System Technology Laboratory
> Tel: 86-10-82452903
> Email: yang...@cn.ibm.com
> Address: Building 28, ZhongGuanCun Software Park,
> No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC
> <https://maps.google.com/?q=100193,+PRC+%E5%8C%97%E4%BA%AC%E5%B8%82%E6%B5%B7%E6%B7%80%E5%8C%BA%E4%B8%9C%E5%8C%97%E6%97%BA%E8%A5%BF%E8%B7%AF8%E5%8F%B7&entry=gmail&source=g>
>
> 北京市海淀区东北旺西路8号
> <https://maps.google.com/?q=100193,+PRC+%E5%8C%97%E4%BA%AC%E5%B8%82%E6%B5%B7%E6%B7%80%E5%8C%BA%E4%B8%9C%E5%8C%97%E6%97%BA%E8%A5%BF%E8%B7%AF8%E5%8F%B7&entry=gmail&source=g>
> 中关村软件园28号楼
> 邮编: 100193
>
>
>
> ----- Original message -----
> From: Brian Joiner <martinitime1...@gmail.com>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
> Cc:
> Subject: Re: [xcat-user] Syncfiles getting deleted after reboot....
> Date: Sat, Aug 11, 2018 7:31 AM
>
> Yang Song,
>
> Could it be related to https://sourceforge.net/p/xcat/bugs/4579/
>
> I checked the remoteshell script and is has /usr/sbin/sshd at the bottom,
> so not sure what's happening.
>
> On Fri, Aug 10, 2018 at 1:22 PM, Brian Joiner <martinitime1...@gmail.com>
> wrote:
>
> Song,
>
> Yes there are otherpkgs that run after the reboot. I'll try to get the OS
> and node defs to you at some point today. The client did respond back
> after doing some investigation of his own:
>
> "..it appears that the script is running without a chroot and writing the
> files to the genimage transient file system." I did a further test by
> creating a 600 second sleep postscript, and found that the syncfiles are in
> the /etc/ but not in /mnt/sysimage/etc/
>
> I put a test file in /mnt/sysimage/etc/ and it survived the reboot. None
> of the 'syncfiles' were there, but my test file was.
>
> Why would syncfiles not write to the correct directory during deployment?
> I'm concerned that something didn't go right during the upgrade.
>
>
> Thanks,
> Brian Joiner
>
> On Thu, Aug 9, 2018 at 10:10 PM, Song BJ Yang <yang...@cn.ibm.com> wrote:
>
> Hi Brian Joiner,
>
> is there any packages specified in `otherpkglist` and `otherpkgdir`? which
> which will be installed by `otherpkgs` during the post-installation reboot
>
>
> would you please provide the osimage definition and node definition? thanks
> ------------------------------------------------------------
> ------------------
> YANG Song (杨嵩)
> IBM China System Technology Laboratory
> Tel: 86-10-82452903
> Email: yang...@cn.ibm.com
> Address: Building 28, ZhongGuanCun Software Park,
> No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC
> <https://maps.google.com/?q=100193,+PRC+%E5%8C%97%E4%BA%AC%E5%B8%82%E6%B5%B7%E6%B7%80%E5%8C%BA%E4%B8%9C%E5%8C%97%E6%97%BA%E8%A5%BF%E8%B7%AF8%E5%8F%B7&entry=gmail&source=g>
>
> 北京市海淀区东北旺西路8号
> <https://maps.google.com/?q=100193,+PRC+%E5%8C%97%E4%BA%AC%E5%B8%82%E6%B5%B7%E6%B7%80%E5%8C%BA%E4%B8%9C%E5%8C%97%E6%97%BA%E8%A5%BF%E8%B7%AF8%E5%8F%B7&entry=gmail&source=g>
> 中关村软件园28号楼
> 邮编: 100193
>
>
>
> ----- Original message -----
> From: Brian Joiner <martinitime1...@gmail.com>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
> Cc:
> Subject: [xcat-user] Syncfiles getting deleted after reboot....
> Date: Wed, Aug 8, 2018 8:31 AM
>
>
> Hardware: Dell
> Deployment OS: RHEL 6.8
> No changes were made to OS definition, other than adding the syncfile list
>
>
> Our client upgraded their xCAT version from 2.7 to 2.14.1 and we're seeing
> some bizarre behavior when deploying the nodes.
>
> Just to make everything as clean as possible, I created a separate group
> with no postscripts (so only the default postscripts run), and removed all
> other post scripts and postbootscripts from the node definition.
>
> What's happening is: during initial deployment, 'syncfiles' copies over
> files, I've verified that they exist with 'ls', then the normal post
> install reboot occurs. After the reboot, all of the synced files are
> GONE. Multiple files, in multiple directories (mostly in /etc). I even
> created a dummy test file to make sure, and it's there during install but
> not after the reboot. Syncfiles always exits with 0
>
> updatenode -F will resync the files, and they survive a reboot.
>
> This problem is so strange, I've never seen anything like it. Any ideas?
>
> --
> Brian Joiner
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
>
> --
> Brian Joiner
>
>
>
> --
> Brian Joiner
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
>
--
Brian Joiner
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user