Hi, Thomas
For "0c:c4:7a:4d:85:a8|0c:c4:7a:58:c7:6a!*NOIP*", so "0c:c4:7a:58:c7:6a" is the MAC address of eth2, right?
Will you pls show me the result of "tabdump discoverydata | grep 0c:c4:7a:58:c7:6a -A20"?
Thx!
Best Regards,
-----------------------------------
Zhao Er Tao
IBM China System and Technology Laboratory, Beijing
Tel:(86-10)82450485
Email: erta...@cn.ibm.com
Address: 1/F, 28 Building,ZhongGuanCun Software Park,
No.8 DongBeiWang West Road, Haidian District,
Beijing, 100193, P.R.China
-----------------------------------
Zhao Er Tao
IBM China System and Technology Laboratory, Beijing
Tel:(86-10)82450485
Email: erta...@cn.ibm.com
Address: 1/F, 28 Building,ZhongGuanCun Software Park,
No.8 DongBeiWang West Road, Haidian District,
Beijing, 100193, P.R.China
----- Original message -----
From: Thomas HUMMEL <thomas.hum...@pasteur.fr>
To: xcat-user@lists.sourceforge.net
Cc:
Subject: Re: [xcat-user] Discovery race condition ?
Date: Thu, Mar 21, 2019 2:19 AM
On 2/25/19 10:04 AM, Er Tao Zhao wrote:
> Hi, Thomas
> Yeah, I agree with Kevin that if you not plan to bond eth0 and eth2,
> you'd better set then in different VLAN.
> But if eth0 and eth2 are in same vlan, xCAT can deal with it after xCAT 2.13
> Will you pls show me the output of `lsdef tars-113`
> And can `tars-113-eth0` or `tars-113-eth2` can be resolved to the same
> ip with `tars-113` in your DNS? And can be get from MN?
> For example `ping tars-113-eth0` or `ping tars-113-eth2` can get same ip
> in node `xcat-tars`
Hello,
as I said, sorry for the delay I've been in vacation then busy.
A quick reminder :
Nodes are normally physically configured like this :
conf 1:
a) eth0 -> switch A (1G) : ipmi (ipmi subnet) traffic allowed
b) eth2 -> switch B (10G) : only data (data subnet) traffic allowed
and logically configured to get switch-based discovered using swith B
I did configure on purpose one node/port like this :
conf 2:
a) eth0 -> switch A (1G) : data (data subnet) + ipmi (ipmi subnet)
traffic allowed
b) eth2 -> switch B (10G) : data (data subnet) only traffic allowed
precisely to be able to handle (by knowing what's happening and/or
forcing eth2 install) a switch misconfiguration which would result to
the conf2 above [and at the time I wrote my initial message, a BIOS boot
order misconfiguration too, but I'm not so sure it has something to do
with the MAC address ending up in the mac table]
What I'm seeing on the console is basically, once loaded and having got
an ip address on the BOOTIF nic ('Aquiring network address message'),
genesis getting 2 ips from the dynamic range by DHCP, one for each nic
after the 'Beginin node discovery process' message and the final MAC
registration in the mac table beeing for the eth0 nic despite the fact that
- discoverydata entry for tars-113 was cleared beforehand
- switch-based discovery attributes where those of eth2 (switch B)
So the node gets finally netbooted via eth0, which is not what I'd want.
- lsdef tars-113 beforehand was :
---
# <xCAT data object stanza file>
tars-113:
objtype=node
addkcmdline=ipv6.disable=1 biosdevname=0 net.ifnames=0
rd.driver.blacklist=nouveau nouveau.modeset=0
arch=x86_64
bmc=10.6.96.115
bmcpassword=XXX
bmcport=0
bmcusername=XXXX
chain=runcmd=bmcsetup,runimage=https://urldefense.proofpoint.com/v2/url?u=http-3A__xcat-2Dtars_install_sum-5Factivate_sum-5Factivate.tgz-2Cosimage-3Dcentos6.10-2Dx86-5F64-2Dnetboot-2Dcompute-2Dprod&d=DwIGaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=GXZJILRYuh625D1P_w_8vxyNDAEGDWoFmPVwQ0NBUcY&m=R1A2OwgmdSDPRQP3MXYSuzUCIWGpAWk8bpM5NoHWq3Q&s=UJEv9keKLIZgJb82yOkFWl-SO8dvtOLom0qkSIJ_DtM&e=
groups=tars-compute,tars-ipmi,tars,standard,b10
ip=192.168.128.115
mgt=ipmi
os=centos6.10
profile=""> provmethod=centos6.10-x86_64-netboot-compute-prod
supportedarchs=x86,x86_64
switch=b10b4.dc1.pasteur.fr
switchport=8
- lsdef afterwards is
Object name: tars-113
addkcmdline=ipv6.disable=1 biosdevname=0 net.ifnames=0
rd.driver.blacklist=nouveau nouveau.modeset=0
arch=x86_64
bmc=10.6.96.115
bmcpassword=XXXX
bmcport=0
bmcusername=XXXX
chain=runcmd=bmcsetup,osimage=centos6.10-x86_64-netboot-compute-prod
cpucount=12
cputype=Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz
currstate=netboot centos6.10-x86_64-compute
disksize=sda:256GB
groups=tars-compute,tars-ipmi,tars,standard,b10
initrd=xcat/osimage/centos6.10-x86_64-netboot-compute-prod/initrd-stateless.gz
ip=192.168.128.115
kcmdline=imgurl=https://urldefense.proofpoint.com/v2/url?u=http-3A__-21myipfn-21-3A80__install_netboot_centos6.10_x86-5F64_compute_prod_rootimg.gz&d=DwIGaQ&c=jf_iaSHvJObTbx-siA1ZOg&r=GXZJILRYuh625D1P_w_8vxyNDAEGDWoFmPVwQ0NBUcY&m=R1A2OwgmdSDPRQP3MXYSuzUCIWGpAWk8bpM5NoHWq3Q&s=yfck6JfRE_2gEtvh0FAJDU-dFXebaciN9sc4OvYC27Y&e=
XCAT=!myipfn!:3001 NODE=tars-113 FC=0
kernel=xcat/osimage/centos6.10-x86_64-netboot-compute-prod/kernel
mac=0c:c4:7a:4d:85:a8|0c:c4:7a:58:c7:6a!*NOIP*
memory=258373MB
mgt=ipmi
netboot=xnba
os=centos6.10
postbootscripts=otherpkgs
profile=""> provmethod=centos6.10-x86_64-netboot-compute-prod
serial=E162178X5A02118
status=booted
statustime=03-20-2019 17:55:02
supportedarchs=x86,x86_64
switch=b10b4.dc1.pasteur.fr
switchport=8
- neither tars-113-eth0 nor tars-113-eth2 (fully qualified or not) can
be resolved
Again, the actual need is not to have 2 nics on the same subnet but to
have some way to choose which MAC will get discovered if so.
Does BIOS PXE order makes any difference to what nic gets into the mac
address at the end ?
Which nic does genesis picks up between the 2 to put in the mac address ?
Does a discovery happen on each nic ?
It seems like genesis actually did the switch-based discovery but
registered the first MAC it saw, thus the mismatch
Thanks.
--
Thomas H.
_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
_______________________________________________ xCAT-user mailing list xCAT-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xcat-user