$ nodedeploy MYHOST
MYHOST: pending: ubuntu-20.04.6-x86_64-default

I have U22.04 available already as well if testing with that is useful. 

The server in question isn’t used for anything special currently. My hope is 
that once I get some basic stuff going with the SuperMicro hardware we can 
start upgrading our Lenovo systems.

> On Nov 10, 2023, at 14:25, Jarrod Johnson <jjohns...@lenovo.com> wrote:
> 
> It should cloud-init as a matter of course, just like for the kickstart 
> installs...
> 
> What does nodedeploy <node> look like when you hit interactive?  May need to 
> look into this more directly next week...
> 
>> From: David Magda <dmagda+x...@ee.torontomu.ca>
>> Sent: Friday, November 10, 2023 2:16 PM
>> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
>> Subject: Re: [xcat-user] [External] Re: xCAT-Confluent
>>  
>> Ah, silly me: bad copy-paste.
>> 
>> That command gives:
>> 
>>          File "/opt/confluent/bin/confluent_selfcheck", line 241
>>            for rsp in sess.read(f'/nodes/{args.node}/attributes/all’):
>>                                                                              
>>                     ^
>>         SyntaxError: invalid syntax
>> 
>> Regardless, I (re-)ran the `nodeattrib` correctly, but that did not help. I 
>> then removed all the “filename=…” stanzas in dhcpd.conf, did a restart, and 
>> the system got (AFAICT) an IP from DHCPd, but Confluent gave it the PXE boot 
>> parameters and the system launched into the Ubuntu 20.04 installer. The 
>> console is prompting me a bunch of questions.
>> 
>> So I’ve think I’ve finally managed to muddle through this part of the 
>> documentation:
>> 
>>         https://hpc.lenovo.com/users/documentation/confluentosdeploy.html
>> 
>> Is there any documentation about automating Ubuntu installs with Confluent? 
>> Does Confluent handle any cloud-init stuff (which was run during the boot 
>> process), or is there some other method to send things that partitioning and 
>> packing information to Ubuntu?
>> 
>> 
>> > On Nov 10, 2023, at 11:01, Jarrod Johnson <jjohns...@lenovo.com> wrote:
>> > 
>> > The attribute name is plural, with s at the end.  
>> > deployment.useinsecureprotocols rather than deployment.useinsecureprotocol.
>> > 
>> > confluent_selfcheck -n MYHOST
>> > 
>> > Say anything interesting?
>> > 
>> >> From: David Magda <dmagda+x...@ee.torontomu.ca>
>> >> Sent: Friday, November 10, 2023 10:50 AM
>> >> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
>> >> Subject: Re: [xcat-user] [External] Re: xCAT-Confluent
>> >>  
>> >> Looking in that file there was:
>> >> 
>> >>         Nov 09 09:02:06 {"info": "Boot attempt by MYHOST detected in 
>> >> insecure 
>> >>         mode, but insecure mode is disabled.  Set the attribute 
>> >>         `deployment.useinsecureprotocols` to `firmware` or `always` to 
>> >> enable 
>> >>         support, or use UEFI HTTP boot with HTTPS." }
>> >> 
>> >> Trying to tweak that attribute, I got:
>> >> 
>> >>         $  nodeattrib MYHOST deployment.useinsecureprotocol=firmware
>> >>         Error: Bad Request - deployment.useinsecureprotocol attribute on 
>> >> node MYHOST is invalid
>> >> 
>> >> I tried using nodegroupattrib as well on a group that the host was in, 
>> >> and got:
>> >> 
>> >>         Error: Bad Request - deployment.useinsecureprotocol attribute is 
>> >> invalid
>> >> 
>> >> I then edited the reply_dhcp4(() function in 
>> >> /opt/confluent/lib/python/confluent/discovery/protocols/pxe.py  to change 
>> >> the default check to remove the “return;" in the "if insecuremode == 
>> >> 'never' and not httpboot:" stanza so that it would continue going. The 
>> >> log message still appears (so I know the code is getting there), but the 
>> >> events file now has:
>> >> 
>> >>         Nov 09 09:18:34 {"info": "Offering PXE boot without address, 
>> >> served from 172.17.15.254 to MYHOST"}
>> >> 
>> >> And the system is still booting xCat (I have commented out 
>> >> "gpxe.no-pxedhcp 1" in dhcpd.conf and restarted).
>> >> 
>> >> Not running the dhcpd at all simply has the system timeout on its PXE 
>> >> attempt. I told Confluent about the particular IP address the system 
>> >> should have:
>> >> 
>> >>         $ nodeattrib MYHOST net.ipv4_address=172.17.15.223/21
>> >> 
>> >> And that did not help.
>> >> 
>> >> Per "lsof -i udp", Confluent is listening on (amongst many other ports) 
>> >> *:bootps, *:dhcpv6-server, *:pxe (etc).
>> >> 
>> >> Should I edit my dhcpd.conf and rip out things like:
>> >> 
>> >>         […]
>> >>         if option user-class-identifier = "xNBA" and option 
>> >> client-architecture = 00:00 { #x86, xCAT Network Boot Agent
>> >>                 always-broadcast on;
>> >>                 filename = "…"
>> >>         […]
>> >> 
>> >> to try to see if that will get things going with Confluent? Or are things 
>> >> expected to work with all of that?
>> >> 
>> >> 
>> >> 
>> >> > On Nov 8, 2023, at 16:19, Jarrod Johnson <jjohns...@lenovo.com> wrote:
>> >> > 
>> >> > tail /var/log/confluent/events for a hint on why it might be ignoring 
>> >> > the request.
>> >> > 
>> >> >> From: David Magda <dma...@ee.torontomu.ca>
>> >> >> Sent: Wednesday, November 8, 2023 2:46 PM
>> >> >> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
>> >> >> Subject: Re: [xcat-user] [External] Re: xCAT-Confluent
>> >> >>  
>> >> >> 
>> >> >> I did a “service dhcpd stop” and a “service confluent restart”, and 
>> >> >> the SuperMicro did not receive any reply to the DHCP/PXE packets it 
>> >> >> was sending out. I then did a “service dhcpd start” and the 
>> >> >> “xcat/genesis” file was loaded.
>> >> >> 
>> >> >> The dhcpd.conf did have "gpxe.no-pxedhcp”, but removing it and 
>> >> >> restarting did not change any behaviour. I noticed that 
>> >> >> “http://IP:80/tftpboot/xcat/xnba/nets/172.17.8.0_21” is being 
>> >> >> referenced.
>> >> >> 
>> >> >> Per “lsof -i udp”, the Confluent is listening on *:bootps, so I’m not 
>> >> >> sure why it is not answering. I had run a “nodedeploy MYHOST -n
>> >> >> ubuntu-20.04.6-x86_64-default” earlier.
>> >> >> 
>> >> >> $ nodeattrib MYHOST
>> >> >> MYHOST: console.method: ipmi
>> >> >> MYHOST: deployment.apiarmed: once
>> >> >> MYHOST: deployment.pendingprofile: ubuntu-20.04.6-x86_64-default
>> >> >> MYHOST: deployment.profile:
>> >> >> MYHOST: deployment.stagedprofile:
>> >> >> MYHOST: deployment.state:
>> >> >> MYHOST: deployment.state_detail:
>> >> >> MYHOST: groups: prox,ipmi,all,everything
>> >> >> MYHOST: hardwaremanagement.manager: MYHOST-ipmi
>> >> >> MYHOST: net.hwaddr: ac:1f:AA:BB:CC:DD
>> >> >> MYHOST: net.ipv4_method: dhcp
>> >> >> MYHOST: secret.hardwaremanagementpassword: ********
>> >> >> MYHOST: secret.hardwaremanagementuser: ********
>> >> >> 
>> >> >> 
>> >> >> > On Nov 7, 2023, at 13:40, Jarrod Johnson  wrote:
>> >> >> > 
>> >> >> > If dhcpd.conf is set to not send any 'filename', it's best.  If you 
>> >> >> > don't need a dhcp server, then you can turn it off.  There's also
>> >> >> > 
>> >> >> > If you have a dhcp server with a dynamic range on it, then:
>> >> >> > nodeattrib  net.ipv4_method=firmwaredhcp
>> >> >> > 
>> >> >> > If you have a dhcp server with static reservations, you could either 
>> >> >> > have dhcp continue, or disallow dhcp for the confluent node.
>> >> >> > 
>> >> >> > If you have no dhcp server, then it should just do the right thing 
>> >> >> > directly.
>> >> >> > 
>> >> >> > If you want to use dhcp ongoing, then 'net.ipv4_method=dhcp', 
>> >> >> > however you own the IPAM sort of responsibility totally.
>> >> >> > 
>> >> >> > If your dhcp has:
>> >> >> >   option gpxe.no-pxedhcp 1;
>> >> >> > Please remove that to let confluent merge an offer with an 
>> >> >> > uncoordinated dhcp server.
>> >> >> > 
>> >> >> > I need to do a deeper right up on the detail about dhcp interaction, 
>> >> >> > how it is now optional, and how it can coexist with an unmanaged 
>> >> >> > dhcp server and free the dhcp server from 'filename'
>> >> >> > 
>> >> >> >> From: David Magda 
>> >> >> >> Sent: Tuesday, November 7, 2023 9:27 AM
>> >> >> >> To: xCAT Users Mailing list 
>> >> >> >> Subject: Re: [xcat-user] [External] Re: xCAT-Confluent
>> >> >> >> 
>> >> >> >> After running the first few commands, I have 
>> >> >> >> /tftpboot/confluent/x86_64/ipxe* and /var/lib/confluent/public/{os, 
>> >> >> >> distribution}/ubuntu* present, along with genesis-x86_64/.
>> >> >> >> 
>> >> >> >> However the contents of the RHEL/CentOS /etc/dhcp/dhcpd.conf are 
>> >> >> >> such that “filename” is “xcat/xnba.*”, so that’s what gets loaded.
>> >> >> >> 
>> >> >> >> Do I need to tweak the dhcpd.conf just for the test system I’m 
>> >> >> >> playing with, or should a completely new dhcpd.conf file be put in 
>> >> >> >> place for using Confluent? (Moving the current one out of the way, 
>> >> >> >> perhaps temporarily until I get an understanding of Confluent so I 
>> >> >> >> can revert to xCat if need-be.)
>> >> >> >> 
>> >> >> >>> On Oct 26, 2023, at 11:33, Jarrod Johnson  wrote:
>> >> >> >>> 
>> >> >> >>> I will say that EL7 hasn't been tested and thus we haven't pushed 
>> >> >> >>> updates since 3.8.0, but 3.8.0 should be plenty.
>> >> >> >>> 
>> >> >> >>> The confluent you have going is already enough to start examining 
>> >> >> >>> OS deployment profiles.  If you would like to, you can use 
>> >> >> >>> commands like osdeploy initialize and osdeploy import and even 
>> >> >> >>> imgutil build, and it won't mess with xCAT.
>> >> >> >>> 
>> >> >> >>> When you get to nodedeploy​, that is the time when you have to 
>> >> >> >>> start planning around potential disruption as xCAT and confluent 
>> >> >> >>> might fight over who gets to deploy a system, and that can be 
>> >> >> >>> confusing.  We should document formally how to mask a node from 
>> >> >> >>> xCAT ('!*NOIP*' in mac table) to let one kick the tires with a 
>> >> >> >>> node...
>> >> >> >>> 
>> >> >> >>> I can help look at a few people kicking tires, certainly seems 
>> >> >> >>> worthy of documentation or video example...
>> >> >> >>>> From: David Magda 
>> >> >> >>>> Sent: Thursday, October 26, 2023 11:22 AM
>> >> >> >>>> To: xCAT Users Mailing list 
>> >> >> >>>> Subject: [External] Re: [xcat-user] xCAT-Confluent
>> >> >> >>>> 
>> >> >> >>>> Yes, there was perhaps auto-completion with regards 
>> >> >> >>>> Confluent/Confluence.
>> >> >> >>>> I currently have a (legacy?) ‘joint’ xCAT-Confluent (3.6) 
>> >> >> >>>> installation on RHEL 7 that I inherited; if one wants to fully 
>> >> >> >>>> move from xCAT to Confluent, is there document on how to 
>> >> >> >>>> ‘extract’ oneself from xCAT? I don’t see anything that jumps out 
>> >> >> >>>> at:
>> >> >> >>>>        https://hpc.lenovo.com/users/
>> >> >> >>>>        https://hpc.lenovo.com/users/documentation/
>> >> >> >>>> Should I simply abandon the previous installation and do a fresh 
>> >> >> >>>> install? While there is some documentation, the system leans 
>> >> >> >>>> towards being heavily vendor-used so people completely new to it 
>> >> >> >>>> have a steep learning curve (xCAT is/was also challenging to get 
>> >> >> >>>> into since it was fairly vendor-focused).
>> >> >> >> […]
>> >> 
>> 
>> 
>> _______________________________________________
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>> https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.sourceforge.net%2Flists%2Flistinfo%2Fxcat-user&data=05%7C01%7Cjjohnson2%40lenovo.com%7C792090eb799c44203d5f08dbe221c79a%7C5c7d0b28bdf8410caa934df372b16203%7C0%7C0%7C638352407016733478%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=HnVKyN2mc6qmLTaPkQafrcs5ZZ3UV9tp%2B9xFz6jf0bE%3D&reserved=0
>> _______________________________________________
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
> 


_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to