Re: [xcat-user] Confignetworks and default route

2018-01-09 Thread Yuan Y Bai
 
I missed one line in last mail:
 
confignetwork migrate routes to new interface. you can also use nicextraparams in nics table to customize something for specific interface,  the nicextraparams content will be added into ifcfg-xxx file. 
 
 
Best Regards--Yuan Bai (白媛)CSTL HPC System Management DevelopmentTel:86-10-82451401E-mail: by...@cn.ibm.comAddress: IBM ZGC Campus. Ring Building 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,Beijing P.R.China 100193IBM环宇大厦北京市海淀区东北旺西路8号,中关村软件园28号楼邮编:100193
 
 
- Original message -From: "Yuan Y Bai" To: xcat-user@lists.sourceforge.netCc: xcat-user@lists.sourceforge.netSubject: Re: [xcat-user] Confignetworks and default routeDate: Wed, Jan 10, 2018 10:12 AM 
Hi Nathan,
 
Thanks Russ.
We cannot configure gateway in nics table. You can configure gateway in networks table for the specific networks , confignetwork also use the gateway from networks table, but confignetworks
If you want to configure the default gateway as the static gateway, after running confignetworks , you can use makeroutes or setroute script to do that , here is my draft doc for these 2 command/script https://github.com/xcat2/xcat-core/pull/4580/commits/e9fd1c9e345997e409b54229be159bafadc3de73
 
 
Best Regards--Yuan Bai (白媛)CSTL HPC System Management DevelopmentTel:86-10-82451401E-mail: by...@cn.ibm.comAddress: IBM ZGC Campus. Ring Building 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,Beijing P.R.China 100193IBM环宇大厦北京市海淀区东北旺西路8号,中关村软件园28号楼邮编:100193
 
 
- Original message -From: Russ Auld To: xCAT Users Mailing list Cc:Subject: Re: [xcat-user] Confignetworks and default routeDate: Wed, Jan 10, 2018 8:28 AM 
The gateway field should be used to set the default route. Make sure there's just one gateway set if you use multiple nics, otherwise the last one will win. 
 
On Jan 9, 2018 12:16 PM, Nathan Harper  wrote:
Hi,
 
We've been using confignetworks post OS install to take the installnic and bond it with another interface.
 
As the default gateway is set by DHCP, is there some config I'm missing in the nics table to get it to set the default gateway?
 --

Nathan Harper // IT Systems Lead
e: nathan.har...@cfms.org.uk   t: 0117 906 1104  m:  0787 551 0891  w: www.cfms.org.uk  
CFMS Services Ltd // Bristol & Bath Science Park // Dirac Crescent // Emersons Green // Bristol // BS16 7FR 
 
CFMS Services Ltd is registered in England and Wales No 05742022 - a subsidiary of CFMS Ltd CFMS Services Ltd registered office // 43 Queens Square // Bristol // BS1 4QP
--Check out the vibrant tech community on one of the world's mostengaging tech sites, Slashdot.org! https://urldefense.proofpoint.com/v2/url?u=http-3A__sdm.link_slashdot=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=wUdq-uqfTIZWkxxk9T5fq0Ms-QlKJVymzSNR1jIlnGc=i8XDzvVqchn8wGhQme7nVtB0-_M9R4j-KdhkqIfJb4U=
___xCAT-user mailing listxCAT-user@lists.sourceforge.nethttps://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=wUdq-uqfTIZWkxxk9T5fq0Ms-QlKJVymzSNR1jIlnGc=pja89jnFc264DoFp_9moXhy7Dek9Iwaa-UjoMZFKOzo=
  

--Check out the vibrant tech community on one of the world's mostengaging tech sites, Slashdot.org! https://urldefense.proofpoint.com/v2/url?u=http-3A__sdm.link_slashdot=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=di4U8_rLCg5toqXle7vmgzpG20nBiVUT4oD9fvMG6D8=1AcmAI7smXKekNM6RFIRMi_pF8ZnZbXVbw3aaNPTrn4=
___xCAT-user mailing listxCAT-user@lists.sourceforge.nethttps://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=di4U8_rLCg5toqXle7vmgzpG20nBiVUT4oD9fvMG6D8=82RnRo29CkkG45FNQ0S5-fLghIkcGAuTmagPxZ2-AWs=
 


--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Debugmode solves stateful installation hang after postscripts

2018-01-09 Thread Casandra H Qiu

We had similar issue (after postscripts, reboot hung) on x86 system, change
bootmode to "Legacy only" fixed.

I will try to ask my teammate to look into this also.  and you can follow
this issue on github.

https://github.com/xcat2/xcat-core/issues/4604


Thanks,
Casandra
...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com





From:   Dudu Edri 
To: xCAT Users Mailing list 
Date:   01/09/2018 04:01 AM
Subject:Re: [xcat-user] Debugmode solves stateful installation hang
after postscripts



The host was with Boot Mode of "UEFI and Legacy". I performed the steps you
advised me but it didn't work, same outcome as before.

As a side note, I did the steps you said on a physical node. We also manage
vms (running on VMware ESXi) with that xCAT. Their state is the same as
with nodes - we run nodeset and reboot - and they get stuck on the same
step as the physical nodes. Everything works for them as well when we
enable xcatdebugmode. So the boot mode might not be the issue.

What do you think?
Thanks

On Jan 8, 2018 5:37 PM, Casandra H Qiu  wrote:
  Can u check bootMode of the node?
  #pasu nodename show

  If output is BootModes.SystemBootMode=UEFI and Legacy,
  you may need to set it as "Legacy only", use this command:
  #pasu nodename set BootModes.SystemBootMode "Legacy Only"
  then run "rinstall"

  If output is UEFI Mode, you need to run
  # rsetboot nodename net -u
  # rpower nodename reset
  rinstall will not work for this case.


  Thanks,
  Casandra
  ...
  Casandra Hong Qiu
  Phone: (845) 433-9291, t/l 293-9291
  Office: Building 8, 3-B-04
  cxh...@us.ibm.com



  Dudu Edri ---01/08/2018 10:22:23 AM---Yes, Intel x86-64. Thanks

  From: Dudu Edri 
  To: xCAT Users Mailing list 
  Date: 01/08/2018 10:22 AM
  Subject: Re: [xcat-user] Debugmode solves stateful installation hang
  after postscripts



  Yes, Intel x86-64.

  Thanks

  On Jan 8, 2018 17:03, Casandra H Qiu  wrote:
is this on x86?


Thanks,
Casandra
...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



Dudu Edri ---01/08/2018 09:56:49 AM---In addition to Daniel's
response, I would like to add more information. I work with Daniel
and his m

From: Dudu Edri 
To: "xcat-user@lists.sourceforge.net"

Date: 01/08/2018 09:56 AM
Subject: Re: [xcat-user] Debugmode solves stateful installation
hang after postscripts



In addition to Daniel's response, I would like to add more
information. I work with Daniel and his mail is related to me as
well, of course.

While debugging the issue, we added a lot of debug messages to the
postscripts and postbootscripts, in order to determine the exact
step that the installation gets stuck on. These debug messages are
written to a file, which is accessible through SSH (the node is
accessible via SSH while running the postscripts).

While examining the file during the installation, we can see that
it fills up with debug messages. We have a debug message on each
script's beginning and end. The last debug message is the ending of
the latest script of "postscripts". Since the node does not reboot,
we don't reach the postbootscripts at all.

In addition to that, by examining the output of "lsdef  -i
currchain" during the hang, we get "boot". When trying to forcibly
reboot the node, it just boots up as expected, runs the
postbootscripts and finishes. It seems that the node does
acknowledge its state to xCAT, but it does not reboot.

As Daniel stated in his previous mail, turning on "xcatdebugmode"
on the "site" table causes the whole process to work.

We would like to understand what is causing this problem.

Thanks

On Jan 8, 2018 16:27, Daniel Letai  wrote:
I can ssh to the node, but when trying to ssh back from
the node to the xcat server it requires a password.
ssh_keys is set to postbootscripts. Should I move it to
postscripts?

On 01/01//2018 17:36, Russ Auld wrote:
Ensure that the node can ssh
back to the MN in the anaconda
environment. The 

Re: [xcat-user] Confignetworks and default route

2018-01-09 Thread Yuan Y Bai
Hi Nathan,
 
Thanks Russ.
We cannot configure gateway in nics table. You can configure gateway in networks table for the specific networks , confignetwork also use the gateway from networks table, but confignetworks
If you want to configure the default gateway as the static gateway, after running confignetworks , you can use makeroutes or setroute script to do that , here is my draft doc for these 2 command/script https://github.com/xcat2/xcat-core/pull/4580/commits/e9fd1c9e345997e409b54229be159bafadc3de73
 
 
Best Regards--Yuan Bai (白媛)CSTL HPC System Management DevelopmentTel:86-10-82451401E-mail: by...@cn.ibm.comAddress: IBM ZGC Campus. Ring Building 28,ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,Beijing P.R.China 100193IBM环宇大厦北京市海淀区东北旺西路8号,中关村软件园28号楼邮编:100193
 
 
- Original message -From: Russ Auld To: xCAT Users Mailing list Cc:Subject: Re: [xcat-user] Confignetworks and default routeDate: Wed, Jan 10, 2018 8:28 AM 
The gateway field should be used to set the default route. Make sure there's just one gateway set if you use multiple nics, otherwise the last one will win. 
 
On Jan 9, 2018 12:16 PM, Nathan Harper  wrote:
Hi,
 
We've been using confignetworks post OS install to take the installnic and bond it with another interface.
 
As the default gateway is set by DHCP, is there some config I'm missing in the nics table to get it to set the default gateway?
 --

Nathan Harper // IT Systems Lead
e: nathan.har...@cfms.org.uk   t: 0117 906 1104  m:  0787 551 0891  w: www.cfms.org.uk  
CFMS Services Ltd // Bristol & Bath Science Park // Dirac Crescent // Emersons Green // Bristol // BS16 7FR 
 
CFMS Services Ltd is registered in England and Wales No 05742022 - a subsidiary of CFMS Ltd CFMS Services Ltd registered office // 43 Queens Square // Bristol // BS1 4QP
--Check out the vibrant tech community on one of the world's mostengaging tech sites, Slashdot.org! https://urldefense.proofpoint.com/v2/url?u=http-3A__sdm.link_slashdot=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=wUdq-uqfTIZWkxxk9T5fq0Ms-QlKJVymzSNR1jIlnGc=i8XDzvVqchn8wGhQme7nVtB0-_M9R4j-KdhkqIfJb4U=
___xCAT-user mailing listxCAT-user@lists.sourceforge.nethttps://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=wUdq-uqfTIZWkxxk9T5fq0Ms-QlKJVymzSNR1jIlnGc=pja89jnFc264DoFp_9moXhy7Dek9Iwaa-UjoMZFKOzo=
 


--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Confignetworks and default route

2018-01-09 Thread Russ Auld
The gateway field should be used to set the default route. Make sure there's just one gateway set if you use multiple nics, otherwise the last one will win. On Jan 9, 2018 12:16 PM, Nathan Harper  wrote:Hi,We've been using confignetworks post OS install to take the installnic and bond it with another interface.As the default gateway is set by DHCP, is there some config I'm missing in the nics table to get it to set the default gateway?-- Nathan Harper // IT Systems Leade: nathan.harper@cfms.org.uk   t: 0117 906 1104  m:  0787 551 0891  w: www.cfms.org.uk  CFMS Services Ltd // Bristol & Bath Science Park // Dirac Crescent // Emersons Green // Bristol // BS16 7FR  CFMS Services Ltd is registered in England and Wales No 05742022 - a subsidiary of CFMS Ltd CFMS Services Ltd registered office // 43 Queens Square // Bristol // BS1 4QP

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] How to inject updates to osimages for stateless nodes?

2018-01-09 Thread Kevin Keane
Generally speaking, osimages are created with the copycds command from an
ISO image.

This will, of course, create an image that is based on the snapshot when
the OS first came out.

With the Meltdown and Spectre vulnerabilities, I would like to update my
image with all the latest packages. We are currently using RH 6.7, and I
want to update it to RH 6.9 plus all the latest updates.

What is the recommended method to do that?

Thanks!

-- 

___
Kevin Keane | Systems Architect | University of San Diego ITS |
kke...@sandiego.edu
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
<%28619%29%20260-2298>
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Confignetworks and default route

2018-01-09 Thread Nathan Harper
Hi,

We've been using confignetworks post OS install to take the installnic and
bond it with another interface.

As the default gateway is set by DHCP, is there some config I'm missing in
the nics table to get it to set the default gateway?

-- 
*Nathan Harper* // IT Systems Lead

*e: *nathan.har...@cfms.org.uk   *t*: 0117 906 1104  *m*:  0787 551 0891
*w: *www.cfms.org.uk
CFMS Services Ltd // Bristol & Bath Science Park // Dirac Crescent // Emersons
Green // Bristol // BS16 7FR

CFMS Services Ltd is registered in England and Wales No 05742022 - a
subsidiary of CFMS Ltd
CFMS Services Ltd registered office // 43 Queens Square // Bristol // BS1
4QP
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Debugmode solves stateful installation hang after postscripts

2018-01-09 Thread Dudu Edri
The host was with Boot Mode of "UEFI and Legacy". I performed the steps you 
advised me but it didn't work, same outcome as before.

As a side note, I did the steps you said on a physical node. We also manage vms 
(running on VMware ESXi) with that xCAT. Their state is the same as with nodes 
- we run nodeset and reboot - and they get stuck on the same step as the 
physical nodes. Everything works for them as well when we enable xcatdebugmode. 
So the boot mode might not be the issue.

What do you think?
Thanks

On Jan 8, 2018 5:37 PM, Casandra H Qiu  wrote:

Can u check bootMode of the node?
#pasu nodename show

If output is BootModes.SystemBootMode=UEFI and Legacy,
you may need to set it as "Legacy only", use this command:
#pasu nodename set BootModes.SystemBootMode "Legacy Only"
then run "rinstall"

If output is UEFI Mode, you need to run
# rsetboot nodename net -u
# rpower nodename reset
rinstall will not work for this case.


Thanks,
Casandra
...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



[Inactive hide details for Dudu Edri ---01/08/2018 10:22:23 AM---Yes, Intel 
x86-64. Thanks]Dudu Edri ---01/08/2018 10:22:23 AM---Yes, Intel x86-64. Thanks

From: Dudu Edri 
To: xCAT Users Mailing list 
Date: 01/08/2018 10:22 AM
Subject: Re: [xcat-user] Debugmode solves stateful installation hang after 
postscripts





Yes, Intel x86-64.

Thanks

On Jan 8, 2018 17:03, Casandra H Qiu  wrote:

  *   is this on x86?


Thanks,
Casandra
...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



Dudu Edri ---01/08/2018 09:56:49 AM---In addition to Daniel's response, I would 
like to add more information. I work with Daniel and his m

From: Dudu Edri 
To: "xcat-user@lists.sourceforge.net" 
Date: 01/08/2018 09:56 AM
Subject: Re: [xcat-user] Debugmode solves stateful installation hang after 
postscripts




In addition to Daniel's response, I would like to add more information. I work 
with Daniel and his mail is related to me as well, of course.

While debugging the issue, we added a lot of debug messages to the postscripts 
and postbootscripts, in order to determine the exact step that the installation 
gets stuck on. These debug messages are written to a file, which is accessible 
through SSH (the node is accessible via SSH while running the postscripts).

While examining the file during the installation, we can see that it fills up 
with debug messages. We have a debug message on each script's beginning and 
end. The last debug message is the ending of the latest script of 
"postscripts". Since the node does not reboot, we don't reach the 
postbootscripts at all.

In addition to that, by examining the output of "lsdef  -i currchain" 
during the hang, we get "boot". When trying to forcibly reboot the node, it 
just boots up as expected, runs the postbootscripts and finishes. It seems that 
the node does acknowledge its state to xCAT, but it does not reboot.

As Daniel stated in his previous mail, turning on "xcatdebugmode" on the "site" 
table causes the whole process to work.

We would like to understand what is causing this problem.

Thanks

On Jan 8, 2018 16:27, Daniel Letai  wrote:
 *
*   I can ssh to the node, but when trying to ssh back from the node to 
the xcat server it requires a password.
ssh_keys is set to postbootscripts. Should I move it to postscripts?

On 01/01//2018 17:36, Russ Auld wrote:
   *
  *
 *
*   Ensure that the node can ssh back to the MN in the 
anaconda environment. The updateflag.awk script can hang trying to update the 
node's status at the end of postscripts.


On Jan 1, 2018 9:32 AM, Daniel Letai 
 wrote:
Hello,

I have encountered a strange issue where sending any node to rinstall "hangs" 
after finishing the postscripts - it never reboots, and therefore never 
continue to the postbootscripts.

Trying to diagnose the issue led to the strange bit.

Setting xcatdebugmode=1 in site table SOLVED the issue while still not showing 
any error in any log.

We have verified this is indeed the case - setting it to 0 reverts to non 
functioning rinstall, re-setting to 1 and rinstall works without an issue.

We would like to work without debugmode - what might be the issue and how can 
we solve this?

xCAT version - 2.13.8

xCAT node OS - RHEL 7.4

Nodes OS - RHEL 6.5

Thanks,

Daniel Letai



--
Check out the vibrant tech community on one of the world's