Hello Kevin,
Normally I split the discvoery part from the installation part when I set up
clusters.
This is all a matter of personal choice.
My guess is you might need a slight pause between commands which are talking to
the the BMC device.
Maybe try a “sleep 2” between the rsetboot and rpower commands?
Particularly with rpower the BMC can take a little while to respond to
sucessive comnmands if run quickly.
You can sometimes see this if you run “rpower <node> on” then run “rpower
<node> stat” immediately after.
You can of course configure the xCAT chain table to automatically install your
nodes once they are discovered if that is something you want to do:
https://xcat-docs.readthedocs.io/en/stable/advanced/chain/run_tasks_during_discovery.html
There could be other vendor-specific reasons you could see the insufficient
resources message but see how you get on with that.
Regards,
David
From: Kevin Keane <kke...@sandiego.edu>
Sent: Friday, May 25, 2018 1:23 AM
To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
Subject: [External] Re: [xcat-user] Understanding the BMC networking
Thanks to your help, discovery is now working well, thank you! I had been
pulled off this project for a while.
Now for the next issue. Discovery works beautifully on the command line, but a
few of the commands fail when I try to script the discovery.
I perform the following steps:
- pre-create my nodes
- run bmcdiscover
- makehosts, makedns, makedhcp.
- rsetboot <nodename> net
- rpower <nodename> reset
- wait until discovery is complete
- nodeset <nodename> osimage=<...>
And my node boots!
Now my problem is that I want to do this in a script. The commands rsetboot and
rpower work fine when I run them manually on a console, but don't work when I
run them via a script - the only difference I can think of is that the script
does not have a tty. The actual error message I am getting is "Error: ERROR:
Insufficient resources to create new session (wait for existing sessions to
timeout)" Note: I am reasonably sure the error message is misleading. I can't
think of any "resource" that would really be "insufficient" and there are no
other "sessions" either.
How can I go about troubleshooting this?
_______________________________________________________________________
Kevin Keane | Systems Architect | University of San Diego ITS |
kke...@sandiego.edu<mailto:kke...@sandiego.edu>
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 |
619.260.6859<tel:%28619%29%20260-2298>
REMEMBER! No one from IT at USD will ever ask to confirm or supply your
password.
These messages are an attempt to steal your username and password. Please do
not reply to, click the links within, or open the attachments of these
messages. Delete them!
On Mon, Mar 19, 2018 at 7:29 PM, Yuan Y Bai
<by...@cn.ibm.com<mailto:by...@cn.ibm.com>> wrote:
Hi Kevin,
In xCAT discovery docs, there are 2 subnets for recommended way of setting up
a cluster with multiple networks to make discovery example easy.
1, In site table, you should configure master using xCAT MN ip in the provision
network.
2, In networks table, here gives you 2 cases:
In case 1: if there are 2 subnets, one is for hardware control, one is for
provision ip:
Hardware control network: 50.0.0.0/255.255.0.0<http://50.0.0.0/255.255.0.0> ——>
you can assign 50.0.100.1-50.0.100.200 in the dynamicrange in networks table,
this dynamicrange is for DHCP server during discovery, you can see 'tabdump -d
networks'
Provision network: 10.0.0.0/255.0.0.0<http://10.0.0.0/255.0.0.0>
You can add 2 networks entries in networks table, like:
“50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,,,,,”50.0.100.1-50.0.100.200",,,,,,,,,
"10_0_0_0-255_0_0_0","10.0.0.0","255.0.0.0",,,,"<xcatmaster>",,,,,,,,,,,,,
In case 2: if there is only one network 50.0.0.0/16<http://50.0.0.0/16> for
hardware control and provision ip, you can add dynamicrange in networks table
during discovery process, after BMC is discovered and configured static ip, you
can remove the dynamic range from the networks table in case it affects
provision process.
Before and during discovery process, networks table:
“50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,"<xcatmaster>",,,,”50.0.100.1-50.0.100.200",,,,,,,,,
After discovery is finished, you should remove dynamic range from networks
table:
“50_0_0_0-255_255_0_0”,”50.0.0.0”,”255.255.0.0”,,,,"<xcatmaster>",,,,,,,,,,,,,
3, After better understanding how configure site table and networks table, you
can re-read the discovery doc to define the node based on the doc and have a
try to discovery process, have a fun with it.
Best Regards
--------------------------------------------------
Yuan Bai (白媛)
CSTL HPC System Management Development
Tel:86-10-82451401
E-mail: by...@cn.ibm.com<mailto:by...@cn.ibm.com>
Address: IBM ZGC Campus. Ring Building 28,
ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
Beijing P.R.China 100193
IBM环宇大厦
北京市海淀区东北旺西路8号,中关村软件园28号楼
邮编:100193
----- Original message -----
From: Kevin Keane <kke...@sandiego.edu<mailto:kke...@sandiego.edu>>
To: xCAT Users Mailing list
<xcat-user@lists.sourceforge.net<mailto:xcat-user@lists.sourceforge.net>>
Cc:
Subject: [xcat-user] Understanding the BMC networking
Date: Tue, Mar 20, 2018 4:55 AM
Hi,
I'm trying to understand the recommended way of setting up a cluster with
multiple networks.
Specifically, I note the recommendation that the BMC should have a static IP
address *in a different subnet* from the node's main IP address, and also in a
different subnet from the DHCP-assigned BMC address during discovery. Does that
imply that the same physical network should have two, or even three, different
subnets running on it?
http://xcat-docs.readthedocs.io/en/stable/guides/admin-guides/manage_clusters/ppc64le/discovery/mtms/discovery_using_defined.html<https://urldefense.proofpoint.com/v2/url?u=http-3A__xcat-2Ddocs.readthedocs.io_en_stable_guides_admin-2Dguides_manage-5Fclusters_ppc64le_discovery_mtms_discovery-5Fusing-5Fdefined.html&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=Y1UWzFDQN_ILkvL4gVl72RWrR-ldMn7PtbC_3_iLRxw&e=>
In this example, there are actually three subnets involved.
The DHCP server apparently serves the subnet
50.0.100.0/24<https://urldefense.proofpoint.com/v2/url?u=http-3A__50.0.100.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=A4_G4o_C7Ne3v-Y4QF3k4QWpgCZLe850mMEbLXnVm7g&e=>
The BMC will be configured with a static IP on
50.0.101.0/24<https://urldefense.proofpoint.com/v2/url?u=http-3A__50.0.101.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=83y26Cf7zBsWEyXl3oVVk6GyqdSstLXXCtCCGjMhxro&e=>
And the node's IP will be on the
10.0.100.0/24<https://urldefense.proofpoint.com/v2/url?u=http-3A__10.0.100.0_24&d=DwMFaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=q7PeV0LU4MnbZYcbzB51ysmbhkWubh57Ue2Gy8VnBiU&e=>
subnet.
Assuming that the BMC does not have a dedicated NIC, but is shared with the
first NIC, how would I configure the various tables in xCAT?
It looks like I would need to touch the network table, the site table, and the
node object for the management node?
Thanks!
--
_______________________________________________________________________
Kevin Keane | Systems Architect | University of San Diego ITS |
kke...@sandiego.edu<mailto:kke...@sandiego.edu>
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 |
619.260.6859<tel:%28619%29%20260-2298>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org!
https://urldefense.proofpoint.com/v2/url?u=http-3A__sdm.link_slashdot&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=t4ldPrLbBJvXBMyEMXHcMfDGf0M9_ymf-p9S2adT7BU&e=
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=uiTcPxjMR44SPRNNb6l_nA&m=30EtCiiIAgBokFzHEiMyaasazbWLD-mtLiDe0eOfTc8&s=FVaZSQ-a5z5U5WjjWVtl-KrSeEdzrNI-Q_7viujiiXE&e=
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user