Please show output of 'lsxcatd -a'.
 
Is there any error/warning message when running the remoteshell postscript?
 
Could you ssh to the compute node when the system is running Postscript.
 
As a workaround, you can move the postscript syncfiles to postbootscripts attribute.
 

Thanks
Best Regards
----------------------------------------------------------------------
Wang Xiaopeng (王晓朋)
IBM China System Technology Laboratory
Tel: 86-10-82453455
Email: w...@cn.ibm.com
Address: 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District Beijing P.R.China 100193
 
 
----- Original message -----
From: "Dr. Markus Hillenbrand" <hille...@rhrk.uni-kl.de>
To: "xcat-user@lists.sourceforge.net" <xcat-user@lists.sourceforge.net>
Cc:
Subject: [xcat-user] Postscripts during stateful installation of SL7.2 not working
Date: Tue, Apr 26, 2016 11:27 PM
 
Hi,

I am trying to deploy compute nodes with a stateful installation of SL7.2.

The packages are installing fine and all but one postscripts are being
executed.

The "syncfiles" script is not working correctly and issues some SSH errors:

Apr 26 15:23:49 node802 xcat: ++ /xcatpost/startsyncfiles.awk
Apr 26 15:23:49 hpcmanage2 xcat[8514]: DEBUG xcatd: connection from node802
Apr 26 15:23:49 hpcmanage2 xcat[8514]: DEBUG xcatd: open new process :
xCATd SSL: syncfiles for node802
Apr 26 15:23:49 hpcmanage2 xcat[8514]: xCAT: Allowing syncfiles from node802
Apr 26 15:23:49 node802 sshd[52987]: error: Could not load host key:
/etc/ssh/ssh_host_dsa_key
Apr 26 15:23:49 node802 sshd[52987]: WARNING: /etc/ssh/moduli does not
exist, using fixed modulus
Apr 26 15:23:49 node802 sshd[52987]: Accepted none for root from
10.255.3.206 port 43852 ssh2
Apr 26 15:23:49 node802 sshd[52987]: Received disconnect from
10.255.3.206: 11: disconnected by user
[...]
Apr 26 15:23:54 node802 sshd[53038]: WARNING: /etc/ssh/moduli does not
exist, using fixed modulus
Apr 26 15:23:54 node802 sshd[53038]: Accepted none for root from
10.255.3.206 port 43905 ssh2
Apr 26 15:23:55 node802 sshd[53038]: Received disconnect from
10.255.3.206: 11: disconnected by user
Apr 26 15:23:55 hpcmanage2 xcat[8514]: DEBUG xcatd: close connection
with node802
Apr 26 15:23:55 node802 xcat: + returncode=0
Apr 26 15:23:55 node802 xcat: + '[' 0 -eq 0 ']'
Apr 26 15:23:55 node802 xcat: + quit=yes
Apr 26 15:23:55 node802 xcat: + let count=count-1
Apr 26 15:23:55 node802 xcat: + '[' yes = no ']'
Apr 26 15:23:55 node802 xcat: + '[' 0 -eq 0 ']'
Apr 26 15:23:55 node802 xcat: + logger -t xcat -p local4.info
'./syncfiles: Perform Syncing File action successfully'
Apr 26 15:23:55 node802 xcat: ./syncfiles: Perform Syncing File action
successfully
Apr 26 15:23:55 node802 xcat: + exit 0
Apr 26 17:23:55 node802 xcat: Tue Apr 26 17:23:55 CEST 2016 postscript
syncfiles return with 0

I guess our xCAT master (10.255.3.206) is not able to ssh to the compute
node during installation.

After the installation process a "updatenode node802 -F" will sync all
the files normally.

My osimage looks like

# lsdef -t osimage Elwe-SL7.2-compute
Object name: Elwe-SL7.2-compute
     imagetype=linux
     osarch=x86_64
     osdistroname=SL7.2-x86_64
     osname=Linux
     osvers=SL7.2
     otherpkgdir=/install/post/otherpkgs/SL7.2/x86_64
     pkgdir=/install/SL7.2/x86_64
     pkglist=/install/rhrk/xcat/Elwe-SL7/compute.pkglist
     profile="">     provmethod=install
     synclists=/install/rhrk/xcat/Elwe-SL7/compute.synclist
     template=/install/rhrk/xcat/Elwe-SL7/compute.tmpl

We are using xCAT-2.11.1-snap201604140932.x86_64 on RHEL6.7.

With SL6.7 this was working fine, but now I am struck and will be
grateful for any help.



Markus

 
------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to