Thanks to your help, discovery is now working well, thank you! I had been
pulled off this project for a while.
Now for the next issue. Discovery works beautifully on the command line,
but a few of the commands fail when I try to script the discovery.
I perform the following steps:
- pre-create my nodes
- run bmcdiscover
- makehosts, makedns, makedhcp.
- rsetboot <nodename> net
- rpower <nodename> reset
- wait until discovery is complete
- nodeset <nodename> osimage=<...>
And my node boots!
Now my problem is that I want to do this in a script. The commands rsetboot
and rpower work fine when I run them manually on a console, but don't work
when I run them via a script - the only difference I can think of is that
the script does not have a tty. The actual error message I am getting is
"Error: ERROR: Insufficient resources to create new session (wait for
existing sessions to timeout)" Note: I am reasonably sure the error message
is misleading. I can't think of any "resource" that would really be
"insufficient" and there are no other "sessions" either.
How can I go about troubleshooting this?
_______________________________________________________________________
Kevin Keane | Systems Architect | University of San Diego ITS |
kke...@sandiego.edu
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
<%28619%29%20260-2298>
*REMEMBER! **No one from IT at USD will ever ask to confirm or supply your
password*.
These messages are an attempt to steal your username and password. Please
do not reply to, click the links within, or open the attachments of these
messages. Delete them!
On Mon, Mar 19, 2018 at 7:29 PM, Yuan Y Bai <by...@cn.ibm.com> wrote:
> Hi Kevin,
>
> In xCAT discovery docs, there are 2 subnets for recommended way of
> setting up a cluster with multiple networks to make discovery example easy.
>
> 1, In site table, you should configure master using xCAT MN ip in the
> provision network.
>
> 2, In networks table, here gives you 2 cases:
>
> *In case 1*: if there are 2 subnets, one is for hardware control, one is
> for provision ip:
>
> Hardware control network: 50.0.0.0/255.255.0.0 ——> you can assign
> 50.0.100.1-50.0.100.200 in the dynamicrange in networks table, this
> dynamicrange is for DHCP server during discovery, you can see 'tabdump -d
> networks'
> Provision network: 10.0.0.0/255.0.0.0
>
> You can add 2 networks entries in networks table, like:
> “50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,,,,,”50.
> 0.100.1-50.0.100.200",,,,,,,,,
> "10_0_0_0-255_0_0_0","10.0.0.0","255.0.0.0",,,,"<xcatmaster>",,,,,,,,,,,,,
>
> *In case 2*: if there is only one network 50.0.0.0/16 for hardware
> control and provision ip, you can add dynamicrange in networks table during
> discovery process, after BMC is discovered and configured static ip, you
> can remove the dynamic range from the networks table in case it affects
> provision process.
>
> Before and during discovery process, networks table:
> “50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,"<
> xcatmaster>",,,,”50.0.100.1-50.0.100.200",,,,,,,,,
>
> After discovery is finished, you should remove dynamic range from networks
> table:
> “50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,"<
> xcatmaster>",,,,,,,,,,,,,
>
> 3, After better understanding how configure site table and networks table,
> you can re-read the discovery doc to define the node based on the doc and
> have a try to discovery process, have a fun with it.
>
> Best Regards
> --------------------------------------------------
> Yuan Bai (白媛)
>
> CSTL HPC System Management Development
> Tel:86-10-82451401
> E-mail: by...@cn.ibm.com
> Address: IBM ZGC Campus. Ring Building 28,
> ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
> Beijing P.R.China 100193
>
> IBM环宇大厦
> 北京市海淀区东北旺西路8号,中关村软件园28号楼
> 邮编:100193
>
>
>
> ----- Original message -----
> From: Kevin Keane <kke...@sandiego.edu>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
> Cc:
> Subject: [xcat-user] Understanding the BMC networking
> Date: Tue, Mar 20, 2018 4:55 AM
>
> Hi,
>
> I'm trying to understand the recommended way of setting up a cluster with
> multiple networks.
>
> Specifically, I note the recommendation that the BMC should have a static
> IP address *in a different subnet* from the node's main IP address, and
> also in a different subnet from the DHCP-assigned BMC address during
> discovery. Does that imply that the same physical network should have two,
> or even three, different subnets running on it?
>
> http://xcat-docs.readthedocs.io/en/stable/guides/admin-
> guides/manage_clusters/ppc64le/discovery/mtms/discovery_using_defined.html
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__xcat-2Ddocs.readthedocs.io_en_stable_guides_admin-2Dguides_manage-5Fclusters_ppc64le_discovery_mtms_discovery-5Fusing-5Fdefined.html&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=Y1UWzFDQN_ILkvL4gVl72RWrR-ldMn7PtbC_3_iLRxw&e=>
>
> In this example, there are actually three subnets involved.
>
> The DHCP server apparently serves the subnet 50.0.100.0/24
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__50.0.100.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=A4_G4o_C7Ne3v-Y4QF3k4QWpgCZLe850mMEbLXnVm7g&e=>
> The BMC will be configured with a static IP on 50.0.101.0/24
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__50.0.101.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=83y26Cf7zBsWEyXl3oVVk6GyqdSstLXXCtCCGjMhxro&e=>
> And the node's IP will be on the 10.0.100.0/24
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__10.0.100.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=q7PeV0LU4MnbZYcbzB51ysmbhkWubh57Ue2Gy8VnBiU&e=>
> subnet.
>
> Assuming that the BMC does not have a dedicated NIC, but is shared with
> the first NIC, how would I configure the various tables in xCAT?
>
> It looks like I would need to touch the network table, the site table, and
> the node object for the management node?
>
> Thanks!
>
> --
>
> _______________________________________________________________________
> Kevin Keane | Systems Architect | University of San Diego ITS |
> kke...@sandiego.edu
> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 |
> 619.260.6859 <%28619%29%20260-2298>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! https://urldefense.proofpoint.
> com/v2/url?u=http-3A__sdm.link_slashdot&d=DwICAg&c=jf_
> iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=
> 30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=
> t4ldPrLbBJvXBMyEMXHcMfDGf0M9_ymf-p9S2adT7BU&e=
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.
> sourceforge.net_lists_listinfo_xcat-2Duser&d=DwICAg&
> c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=
> 30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=FVaZSQ-
> a5z5U5WjjWVtl-KrSeEdzrNI-Q_7viujiiXE&e=
>
>
>
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user