I have an x86_64 management node running CentOS 7.9 and most of my cluster is x86_64 nodes running the same OS. I’m trying to test RHEL8 by installing it on one of our ppc64le nodes, but I’ve found that the remoteshell postscript is failing to install the correct SSH hostkeys on the node during installation and when run manually after the node comes up after the install completes.
I’ve enabled xcatdebugmode and this is what I see in the logs from the install when that postscript runs: Mar 16 14:09:40 m002 xcat.deployment.postscript INFO Running postscript: remoteshell Mar 16 14:09:40 m002 xcat[36244]: INFO Install: rsyslog version 8 setup Mar 16 14:09:40 m002 xcat[36268]: INFO remoteshell: setup /etc/ssh/sshd_config and ssh_config Mar 16 14:09:40 m002 xcat[36273]: INFO Install: setup root .ssh Mar 16 14:09:41 m002 xcat[36280]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:41 m002 xcat[36310]: INFO remoteshell:xcatflowrequest received response return=0 Mar 16 14:09:41 m002 xcat[36324]: INFO remoteshell: getting ssh_host_dsa_key Mar 16 14:09:41 m002 xcat[36326]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:41 m002 xcat[36356]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:41 m002 xcat[36368]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36398]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36410]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36440]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36452]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36482]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36494]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36524]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36536]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36566]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36578]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36608]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36620]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:42 m002 xcat[36650]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:42 m002 xcat[36662]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[36692]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[36710]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[36740]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[36754]: INFO ssh_rsa_hostkey Mar 16 14:09:43 m002 xcat[36756]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[36786]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[36798]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[36828]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[36840]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[36870]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[36882]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[37017]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:43 m002 xcat[37029]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:43 m002 xcat[37143]: INFO remoteshell:xcatflowrequest return=0 Mar 16 14:09:44 m002 xcat[37155]: INFO remoteshell: sending xcatflowrequest 172.18.10.201 3001 Mar 16 14:09:44 m002 xcat[37520]: INFO remoteshell:xcatflowrequest return=2 Mar 16 14:09:44 m002 xcat[37521]: INFO remoteshell: error from xcatflowrequest, will not use flow control Mar 16 14:10:39 m002 xcat[37616]: INFO ssh_ecdsa_hostkey Mar 16 14:12:56 m002 xcat[37796]: INFO remoteshell: gathering ssh_root_pub_key Mar 16 14:12:56 m002 xcat[37802]: INFO ssh_root_pub_key Mar 16 14:15:28 m002 xcat[38065]: INFO remoteshell:sshbetweennodes is yes Mar 16 14:15:28 m002 xcat[38076]: INFO remoteshell: gathering ssh_root_key Mar 16 14:15:28 m002 xcat[38080]: INFO ssh_root_key Mar 16 14:18:02 m002 xcat.deployment.postscript INFO postscript remoteshell return with 0 It looks to me like it has the correct return code, but the hostkey files are not correct after reboot. I found an old post that suggested to run the getcredentials.awk script manually after starting the miniserver on the compute node: /xcatpost/allowcred.awk & USEOPENSSLFORXCAT=yes XCATSERVER=172.18.10.201:3001 /xcatpost/getcredentials.awk ssh_rsa_hostkey If I do that from an x86_64 CentOS 7.9 node it returns output that includes a hostkey, but if I run the same thing from my ppc64le RHEL 8.2 node it returns no data, but has an exitcode of 0. Any thoughts on what might be wrong or what else I can check in order to fix this? Mike Robbert Cyberinfrastructure Specialist, Cyberinfrastructure and Advanced Research Computing Information and Technology Solutions (ITS) 303-273-3786 | mrobb...@mines.edu Our values: Trust | Integrity | Respect | Responsibility
smime.p7s
Description: S/MIME cryptographic signature
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user