I have an x86_64 management node running CentOS 7.9 and most of my cluster is 
x86_64 nodes running the same OS. I’m trying to test RHEL8 by installing it on 
one of our ppc64le nodes, but I’ve found that the remoteshell postscript is 
failing to install the correct SSH hostkeys on the node during installation and 
when run manually after the node comes up after the install completes. 

I’ve enabled xcatdebugmode and this is what I see in the logs from the install 
when that postscript runs:

 

Mar 16 14:09:40 m002 xcat.deployment.postscript INFO Running postscript: 
remoteshell

Mar 16 14:09:40 m002 xcat[36244]: INFO  Install: rsyslog version 8 setup

Mar 16 14:09:40 m002 xcat[36268]: INFO  remoteshell:  setup 
/etc/ssh/sshd_config and ssh_config

Mar 16 14:09:40 m002 xcat[36273]: INFO  Install: setup root .ssh

Mar 16 14:09:41 m002 xcat[36280]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:41 m002 xcat[36310]: INFO  remoteshell:xcatflowrequest received 
response return=0

Mar 16 14:09:41 m002 xcat[36324]: INFO  remoteshell: getting ssh_host_dsa_key

Mar 16 14:09:41 m002 xcat[36326]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:41 m002 xcat[36356]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:41 m002 xcat[36368]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36398]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36410]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36440]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36452]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36482]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36494]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36524]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36536]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36566]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36578]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36608]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36620]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:42 m002 xcat[36650]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:42 m002 xcat[36662]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[36692]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[36710]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[36740]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[36754]: INFO  ssh_rsa_hostkey

Mar 16 14:09:43 m002 xcat[36756]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[36786]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[36798]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[36828]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[36840]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[36870]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[36882]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[37017]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:43 m002 xcat[37029]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:43 m002 xcat[37143]: INFO  remoteshell:xcatflowrequest return=0

Mar 16 14:09:44 m002 xcat[37155]: INFO  remoteshell: sending xcatflowrequest 
172.18.10.201 3001

Mar 16 14:09:44 m002 xcat[37520]: INFO  remoteshell:xcatflowrequest return=2

Mar 16 14:09:44 m002 xcat[37521]: INFO  remoteshell: error from 
xcatflowrequest, will not use flow control

Mar 16 14:10:39 m002 xcat[37616]: INFO  ssh_ecdsa_hostkey

Mar 16 14:12:56 m002 xcat[37796]: INFO  remoteshell: gathering ssh_root_pub_key

Mar 16 14:12:56 m002 xcat[37802]: INFO  ssh_root_pub_key

Mar 16 14:15:28 m002 xcat[38065]: INFO  remoteshell:sshbetweennodes is yes

Mar 16 14:15:28 m002 xcat[38076]: INFO  remoteshell: gathering ssh_root_key

Mar 16 14:15:28 m002 xcat[38080]: INFO  ssh_root_key

Mar 16 14:18:02 m002 xcat.deployment.postscript INFO postscript remoteshell 
return with 0

 

It looks to me like it has the correct return code, but the hostkey files are 
not correct after reboot.

 

I found an old post that suggested to run the getcredentials.awk script 
manually after starting the miniserver on the compute node:

/xcatpost/allowcred.awk &

USEOPENSSLFORXCAT=yes XCATSERVER=172.18.10.201:3001 
/xcatpost/getcredentials.awk ssh_rsa_hostkey

 

If I do that from an x86_64 CentOS 7.9 node it returns output that includes a 
hostkey, but if I run the same thing from my ppc64le RHEL 8.2 node it returns 
no data, but has an exitcode of 0. 

Any thoughts on what might be wrong or what else I can check in order to fix 
this?

 

Mike Robbert

Cyberinfrastructure Specialist, Cyberinfrastructure and Advanced Research 
Computing

Information and Technology Solutions (ITS)

303-273-3786 | mrobb...@mines.edu  

Our values: Trust | Integrity | Respect | Responsibility

Attachment: smime.p7s
Description: S/MIME cryptographic signature

_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to