[xcat-user] xCAT as simple PXE Server for Unattended installations

2017-10-07 Thread Vinícius Ferrão
Hello,

I would like to use xCAT as a simple baremetal provisioner.

As far as I understood from the documentation the deployed machines must talk 
with the xCAT server after the installation, (correct-me if I’m wrong please) 
so if the xCAT server goes away what happens next? There’s a way to safely 
ditch the server after installation?

The idea of using xCAT is just to avoid to manually install / deploy: DHCP, 
PXE, TFTP, Local Repository, Kickstart Files, etc. 

Thanks,
V.
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Local scratch for stateless compute nodes

2017-11-27 Thread Vinícius Ferrão
ing.
> 
>  
> 
> Gilad Berman
> HPC Architect
> Lenovo EMEA
> +972-52-2554262 <tel:+972%2052-255-4262>
> gber...@lenovo.com <mailto:gber...@lenovo.com>
>  
> Lenovo.com  <http://www.lenovo.com/>
> Twitter <http://twitter.com/lenovo> | Facebook 
> <http://www.facebook.com/lenovo> | Instagram <https://instagram.com/lenovo> | 
> Blogs <http://blog.lenovo.com/> | Forums <http://forums.lenovo.com/> 
> 
>  
>  
> From: Russ Auld [mailto:russa...@comcast.net <mailto:russa...@comcast.net>] 
> Sent: Monday, November 27, 2017 4:50 PM
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net 
> <mailto:xcat-user@lists.sourceforge.net>>
> Subject: Re: [xcat-user] Local scratch for stateless compute nodes
>  
> If you're using netboot and local disk,  then isn't that "statelite"?
> Do the satellite instructions not work? 
>  
> On Nov 27, 2017 9:26 AM, Gilad Berman <gber...@lenovo.com 
> <mailto:gber...@lenovo.com>> wrote:
> All,
> 
>  
> 
> I would like to join this question –
> 
> Does even localdisk works with stateless? From the docs it seems that should 
> be supported (because it is under stateless), however –
> 
> - the instructions are taken from statelite and refer to statelite 
> code (litefile)
> 
> - The rc.localdisk code is under statelite
> 
> - In the linuximage man – “Partitionfile - Only available for diskful 
> osimages and statelite osimages(localdisk enabled)“
> 
>  
> 
> A very quick trial on my statless nodes results in nothing J, it seems there 
> is simply no reference to localdisk with stateless.
> 
>  
> 
> So, can someone please help clarify it?
> 
>  
> 
> ** as always, there is a chance I missed something very basic and it should 
> be working J
> 
>  
> 
> THX in advance!
> 
>  
> 
> 
> Gilad Berman
> HPC Architect
> Lenovo EMEA
> 
> +972-52-2554262 <tel:+972%2052-255-4262>
> gber...@lenovo.com <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com> 
> Lenovo.com 
> Twitter | Facebook | Instagram | Blogs | Forums <mailto:gber...@lenovo.com>   
>  <mailto:gber...@lenovo.com>  
>   <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> From: Vinícius Ferrão [mailto:fer...@versatushpc.com.br] 
> Sent: Wednesday, November 22, 2017 4:09 AM
> To: xcat-user@lists.sourceforge.net
> Subject: [xcat-user] Local scratch for stateless compute nodes 
> <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> Hello, <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> I would like to enable swap and local /tmp on my stateless nodes, but after 
> following the documentation on the following link nothing appears to work: 
> <mailto:gber...@lenovo.com>
> http://xcat-docs.readthedocs.io/en/stable/advanced/hierarchy/provision/diskless_sn.html
>  <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> I’m aware that the documentation is for service nodes and not for compute 
> nodes, but I was thinking the procedure would be similar. 
> <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> At this point I’m with this settings on osimage: <mailto:gber...@lenovo.com>
> [root@headnode xcat]# lsdef -t osimage centos7.4-x86_64-netboot-compute
>     exlist=/opt/xcat/share/xcat/netboot/centos/compute.centos7.exlist
>     imagetype=linux
>     osarch=x86_64
>     osdistroname=centos7.4-x86_64
>     osname=Linux
>     osvers=centos7.4
>     otherpkgdir=/install/post/otherpkgs/centos7.4/x86_64 
> <mailto:gber...@lenovo.com>
>     
> partitionfile=/install/custom/netboot/centos7.4-x86_64-netboot-compute/partitionfile
>  <mailto:gber...@lenovo.com>
>     permission=755
>     pkgdir=/install/centos7.4/x86_64
>     pkglist=/opt/xcat/share/xcat/netboot/centos/compute.centos7.pkglist
>     
> postinstall=/opt/xcat/share/xcat/netboot/centos/compute.centos7.postinstall
>     profile=compute
>     provmethod=netboot
>     rootimgdir=/install/netboot/centos7.4/x86_64/compute
>     synclists=/install/custom/netboot/compute.synclist 
> <mailto:gber...@lenovo.com>
>   <mailto:gber...@lenovo.com>
> And the content of the partition file is the following: 
> <mailto:gber...@lenovo.com>
> cat /install/custom/netboot/centos7.4-x86_64-netboot-compute/partitionfile
> enable=yes
> enablepart=yes
> 
> [disk]
> dev=/dev/sda
> clear=yes
> parts=10,90
> 
> [swapspace]
> dev=/dev/sda1
> 
> [localspace]
> dev=/dev/sda2
> fstype=xfs <mai

Re: [xcat-user] Local scratch for stateless compute nodes

2017-11-29 Thread Vinícius Ferrão
24tdlTRi9-pSunWaE=>
>   
> <image.image004@01d36855.83dbdf10.jpg>
> 
>  
>  
> 
>  
> 
> From: Yuan Y Bai [mailto:by...@cn.ibm.com]
> Sent: Tuesday, November 28, 2017 7:30 AM
> To: xcat-user@lists.sourceforge.net
> Cc: xcat-user@lists.sourceforge.net
> Subject: Re: [xcat-user] Local scratch for stateless compute nodes
> 
>  
> 
> Hello,
> 
>  
> 
> Please refer to "Enabling the localdisk Option" section under "diskless 
> installation" section : 
> http://xcat-docs.readthedocs.io/en/latest/guides/admin-guides/manage_clusters/ppc64le/diskless/customize_image/localdisk.html
>  
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__xcat-2Ddocs.readthedocs.io_en_latest_guides_admin-2Dguides_manage-5Fclusters_ppc64le_diskless_customize-5Fimage_localdisk.html=DwMGaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=ViJ7B2a6TXCErHqzBDQi34NpuIAwf6uQyk6FhdDE4ec=1NipLGoEswxL2k-sbxORfbeKgIBXcG3uOs0v50jRt2k=>
>  
> 
> The main diskless installation doc link is here:
> 
> http://xcat-docs.readthedocs.io/en/latest/guides/admin-guides/manage_clusters/ppc64le/diskless/index.html
>  
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__xcat-2Ddocs.readthedocs.io_en_latest_guides_admin-2Dguides_manage-5Fclusters_ppc64le_diskless_index.html=DwMGaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=ViJ7B2a6TXCErHqzBDQi34NpuIAwf6uQyk6FhdDE4ec=Ho3GD3Mv9ulpMg7-u1EXZx0_vRtzb9suHq1iLvCg2ns=>
>  
> 
>  
> 
> Best Regards
> --
> Yuan Bai (白媛)
> 
> CSTL HPC System Management Development
> Tel:86-10-82451401 
> E-mail: by...@cn.ibm.com <mailto:by...@cn.ibm.com>
> Address: IBM ZGC Campus. Ring Building 28,
> ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
> Beijing P.R.China 100193
> 
> IBM环宇大厦
> 北京市海淀区东北旺西路8号,中关村软件园28号楼
> 邮编:100193
> 
>  
> 
>  
> 
> - Original message -
> From: Vinícius Ferrão <fer...@versatushpc.com.br 
> <mailto:fer...@versatushpc.com.br>>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net 
> <mailto:xcat-user@lists.sourceforge.net>>
> Cc:
> Subject: Re: [xcat-user] Local scratch for stateless compute nodes
> Date: Tue, Nov 28, 2017 12:40 PM
>  
> 
> Hello,
> 
>  
> 
> The major problem is the missing documentation on how to use the local disks 
> on a stateless (diskless) installation. Tricks are being deployed instead.
> 
>  
> 
> Trying to mimic the steps of a statelite installation is not sufficient to 
> make it work on stateless deployments.
> 
>  
> 
> V.
> 
>  
> 
> On 28 Nov 2017, at 00:15, Yuan Y Bai <by...@cn.ibm.com 
> <mailto:by...@cn.ibm.com>> wrote:
> 
>  
> 
> That is correct, diskless and statelite both support local disk for different 
> purposes now.
> 
>  
> 
> Best Regards
> --
> Yuan Bai (白媛)
> 
> CSTL HPC System Management Development
> Tel:86-10-82451401 
> E-mail: by...@cn.ibm.com <mailto:by...@cn.ibm.com>
> Address: IBM ZGC Campus. Ring Building 28,
> ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
> Beijing P.R.China 100193
> 
> IBM环宇大厦
> 北京市海淀区东北旺西路8号,中关村软件园28号楼
> 邮编:100193
> 
>  
> 
>  
> 
> - Original message -
> From: Vinícius Ferrão <fer...@versatushpc.com.br 
> <mailto:fer...@versatushpc.com.br>>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net 
> <mailto:xcat-user@lists.sourceforge.net>>
> Cc:
> Subject: Re: [xcat-user] Local scratch for stateless compute nodes
> Date: Tue, Nov 28, 2017 12:14 AM
>  
> 
>  
> 
> Hello,
> 
>  
> 
> Accordingly to the documentation they are different on a specific way:
> 
>  
> 
> . Stateless: nodes boot from a RAMdisk OS image downloaded from the xCAT mgmt 
> node or service node at boot time.
> 
>  
> 
> . Statelite: nodes boot from an NFS-root diskless OS image.
> 
>  
> 
> And both support local disk for different purposes, what we are targeting are 
> local disks just for scratch and swap. They will not handle any state, and 
> this is perfectly supported as said on the documentation.
> 
>  
> 
> Here’s the documentation: 
> https://sourceforge.net/p/xcat/wiki/XCAT_Overview,_Architecture,_and_Planning/#xcat-cluster-node-types
>  
> <https://urldefense.proofpoint.com/v2/url?u=https-3A__sourceforge.net_p_xcat_wiki_XCAT-5FOverview-2C-5FArchitecture-2C-5Fand-5FPlanning_-23xcat-2Dcluster-2Dnode-2Dtypes=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=fH1AxDW

Re: [xcat-user] Local scratch for stateless compute nodes

2017-11-27 Thread Vinícius Ferrão
Hello,

The major problem is the missing documentation on how to use the local disks on 
a stateless (diskless) installation. Tricks are being deployed instead.

Trying to mimic the steps of a statelite installation is not sufficient to make 
it work on stateless deployments.

V.

> On 28 Nov 2017, at 00:15, Yuan Y Bai <by...@cn.ibm.com> wrote:
> 
> That is correct, diskless and statelite both support local disk for different 
> purposes now.
>  
> Best Regards
> --
> Yuan Bai (白媛)
> 
> CSTL HPC System Management Development
> Tel:86-10-82451401
> E-mail: by...@cn.ibm.com
> Address: IBM ZGC Campus. Ring Building 28,
> ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
> Beijing P.R.China 100193
> 
> IBM环宇大厦
> 北京市海淀区东北旺西路8号,中关村软件园28号楼
> 邮编:100193
>  
>  
> - Original message -
> From: Vinícius Ferrão <fer...@versatushpc.com.br>
> To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
> Cc:
> Subject: Re: [xcat-user] Local scratch for stateless compute nodes
> Date: Tue, Nov 28, 2017 12:14 AM
>  
>  
> Hello,
>  
> Accordingly to the documentation they are different on a specific way:
>  
> . Stateless: nodes boot from a RAMdisk OS image downloaded from the xCAT mgmt 
> node or service node at boot time.
>  
> . Statelite: nodes boot from an NFS-root diskless OS image.
>  
> And both support local disk for different purposes, what we are targeting are 
> local disks just for scratch and swap. They will not handle any state, and 
> this is perfectly supported as said on the documentation.
>  
> Here’s the documentation: 
> https://sourceforge.net/p/xcat/wiki/XCAT_Overview,_Architecture,_and_Planning/#xcat-cluster-node-types
>  
> <https://urldefense.proofpoint.com/v2/url?u=https-3A__sourceforge.net_p_xcat_wiki_XCAT-5FOverview-2C-5FArchitecture-2C-5Fand-5FPlanning_-23xcat-2Dcluster-2Dnode-2Dtypes=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=fH1AxDWPRLVZLj9RfODpdKPJOENCWrDCRvH0_M8TsjA=>
>  
> Thanks,
> V.
>  
> Sent from my iPhone
> 
> On 27 Nov 2017, at 12:50, Russ Auld <russa...@comcast.net 
> <mailto:russa...@comcast.net>> wrote:
>  
>> If you're using netboot and local disk,  then isn't that "statelite"?
>> Do the satellite instructions not work? 
>>  
>> On Nov 27, 2017 9:26 AM, Gilad Berman <gber...@lenovo.com 
>> <mailto:gber...@lenovo.com>> wrote:
>> All, <>
>>  
>> I would like to join this question –
>> 
>> Does even localdisk works with stateless? From the docs it seems that should 
>> be supported (because it is under stateless), however –
>> 
>> - the instructions are taken from statelite and refer to statelite 
>> code (litefile)
>> 
>> - The rc.localdisk code is under statelite
>> 
>> - In the linuximage man – “Partitionfile - Only available for 
>> diskful osimages and statelite osimages(localdisk enabled)“
>> 
>>  
>> A very quick trial on my statless nodes results in nothing J, it seems there 
>> is simply no reference to localdisk with stateless.
>> 
>>  
>> So, can someone please help clarify it?
>> 
>>  
>> ** as always, there is a chance I missed something very basic and it should 
>> be working J
>> 
>>  
>> THX in advance!
>> 
>>  
>> 
>> 
>> Gilad Berman
>> HPC Architect
>> Lenovo EMEA
>> 
>> +972-52-2554262
>> gber...@lenovo.com <mailto:gber...@lenovo.com>
>>  
>>  
>> Lenovo.com  
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.lenovo.com_=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=X4IxhzhlpZBcc7jTaTYACA25o1VeJYQRFpalyqxbTAA=>
>> Twitter 
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__twitter.com_lenovo=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=gho5DwChsdnzs9Dw5ZV_88KpPl44Ip7sh52Wg0EBHhY=>
>>  | Facebook 
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.facebook.com_lenovo=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=JOvwli3J9WR0J39QGoi4B0QeJy10-ovWRDSiLphZUq8=>
>>  | Instagram 
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__instagram.com_lenovo=DwMFaQ=jf_iaSHvJObTbx-siA1ZOg=uiTcPxjMR44SPRNNb6l_nA=oAQVc_Sh4trqtOHvVelQBzsfz8vF7IL0CwQcnYf1Nbs=4AAUTt59CFyotMoxHfUfncLrD9IWapwC9-OIIZLV9zQ=>
>>  | Blogs 
>> <https://urldefense.proofpoint.com/v2/u

Re: [xcat-user] SELinux - why not?

2017-12-01 Thread Vinícius Ferrão
AFAIK, it cannot generate BIND files correctly:

[root@cluster ~]# makedns -n
Warning: SELINUX is not disabled. The makedns command will not be able to 
generate a complete DNS setup. Disable SELINUX and run the command again.

Sent from my iPhone

> On 30 Nov 2017, at 15:31, Kevin Keane  wrote:
> 
> As far as I understand it, the recommendation is to disable SELinux with 
> xCAT. There actually is a warning built into makedns about this.
> 
> As a professional paranoid, I always prefer to have as many of the security 
> mechanisms as possible running, and I want to understand exactly why a 
> particular mechanism is not recommended.
> 
> In xCAT, what, specifically, does SELinux interfere with?
> 
> Thanks!
> 
> -- 
> ___
> Kevin Keane | Systems Architect | University of San Diego ITS | 
> kke...@sandiego.edu
> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
> 
> --
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Local scratch for stateless compute nodes

2017-11-21 Thread Vinícius Ferrão
Hello,

I would like to enable swap and local /tmp on my stateless nodes, but after 
following the documentation on the following link nothing appears to work:
http://xcat-docs.readthedocs.io/en/stable/advanced/hierarchy/provision/diskless_sn.html
 


I’m aware that the documentation is for service nodes and not for compute 
nodes, but I was thinking the procedure would be similar.

At this point I’m with this settings on osimage:
[root@headnode xcat]# lsdef -t osimage centos7.4-x86_64-netboot-compute
exlist=/opt/xcat/share/xcat/netboot/centos/compute.centos7.exlist
imagetype=linux
osarch=x86_64
osdistroname=centos7.4-x86_64
osname=Linux
osvers=centos7.4
otherpkgdir=/install/post/otherpkgs/centos7.4/x86_64

partitionfile=/install/custom/netboot/centos7.4-x86_64-netboot-compute/partitionfile
permission=755
pkgdir=/install/centos7.4/x86_64
pkglist=/opt/xcat/share/xcat/netboot/centos/compute.centos7.pkglist
postinstall=/opt/xcat/share/xcat/netboot/centos/compute.centos7.postinstall
profile=compute
provmethod=netboot
rootimgdir=/install/netboot/centos7.4/x86_64/compute
synclists=/install/custom/netboot/compute.synclist

And the content of the partition file is the following:
cat /install/custom/netboot/centos7.4-x86_64-netboot-compute/partitionfile
enable=yes
enablepart=yes

[disk]
dev=/dev/sda
clear=yes
parts=10,90

[swapspace]
dev=/dev/sda1

[localspace]
dev=/dev/sda2
fstype=xfs

Finally the following commands were executed:
chtab priority=7.1 policy.commands=getpartition policy.rule=allow
chtab litefile.image=centos7.4-x86_64-netboot-compute litefile.file=/var/log/ 
litefile.options=localdisk
chtab litefile.image=centos7.4-x86_64-netboot-compute litefile.file=/tmp/ 
litefile.options=localdisk

After a new genimage/packimage the local scratch does not appears to be made 
nor working.

Someone knows what to do next?

Thanks,
V.

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] xCAT on SGI ICE X

2018-08-28 Thread Vinícius Ferrão
Hello,

I would like to know if someone is running xCAT on SGI ICE X. Since the SGI ICE 
X needs the CMC drivers I’m unsure if xCAT  can be used to control the machines 
instead of the proprietary software from the faded SGI.

Thanks,


--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Proper way to upgrade images

2018-11-08 Thread Vinícius Ferrão
Hello,

I would like to put on debate which is the proper way to upgrade images to 
different OS releases.

For example, I have this images on my system:

centos7.4-x86_64-install-compute  (osimage)
centos7.4-x86_64-netboot-compute  (osimage)
centos7.4-x86_64-statelite-compute  (osimage)

Those are generated by the copycds command, I only use the netboot one. As you 
can see it was generated from a CentOS 7.4 DVD.

Now we are on the 7.5 days and I would like to upgrade this images to 7.5.

I can just issue an yum with installroot argument and fire up an upgrade. But 
this would properly update the image?

If yes, the naming would became confusing, since the image has gone to 7.5 but 
the name stays 7.4.

Someone may say that I need to reimport the images with copycds once again, but 
doing this, I will lose all the customisations of the image and software 
installed.

So, what’s the right procedure? What I’m missing?

Thanks,



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Using online repos with xCAT

2018-11-12 Thread Vinícius Ferrão
Kevin, you might want to check upstream_sync.py, for mirroring RHN:
https://github.com/pyther/upstream_sync 


Let me hijack this thread, which procedure will be the correct one to ditch 
copycds? Who will create the images on the definition tables?

Thanks,

> On 12 Nov 2018, at 15:26, Kevin Keane  wrote:
> 
> Thank you! I'm not sure when I'd get around to doing such a PR.
> 
> For the time being, my plan B is to build a local repository, similar to what 
> copycds does - but I'll use reposync to retrieve the RPMs from the 
> SSL-protected repository.
> 
> Even with that limitation, your pointer to being able to use repository URLs 
> is still invaluable - I can use that approach for the EPEL and other 
> repositories.
> 
> Thanks!
> ___
> Kevin Keane | Systems Architect | University of San Diego ITS | 
> kke...@sandiego.edu 
> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
> 
> REMEMBER! No one from IT at USD will ever ask to confirm or supply your 
> password.
> These messages are an attempt to steal your username and password. Please do 
> not reply to, click the links within, or open the attachments of these 
> messages. Delete them!
> 
> 
> 
> 
> 
> On Mon, Nov 12, 2018 at 2:55 AM Song BJ Yang  > wrote:
> Hi Kevin,
>  
> Now we do not have such support, sorry for inconvenience, one possible 
> solution is you can leverage `postinstall` attribute to inject the scripts to 
> compose additional yum config files, RHN certificates and invoke yum commands 
> to install/update packages from RHN. We will consider this feature request, 
> but cannot promise its priority or date
>  
> or if you would like to complete some mechanism to support this, it will be 
> much appreciated if you can create a PR on this
>  
>  
> best regards
> --
> YANG Song (杨嵩)
> IBM China System Technology Laboratory
> Tel: 86-10-82452903
> Email: yang...@cn.ibm.com 
> Address: Building 28, ZhongGuanCun Software Park,
> No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC
> 
> 北京市海淀区东北旺西路8号中关村软件园28号楼
> 邮编: 100193
>  
>  
> - Original message -
> From: Kevin Keane mailto:kke...@sandiego.edu>>
> To: xCAT Users Mailing list  >
> Cc:
> Subject: Re: [xcat-user] Using online repos with xCAT
> Date: Sat, Nov 10, 2018 8:36 AM
>  
> Thank you so much for point out that pkgdir can also take a repository URL! 
> That will turn out ot be invaluable in our situation, once I get it to work.
>  
> My next question is whether there is a way to inject a setting into the yum 
> configuration file that genimage uses? We are a RedHat shop and have enough 
> RedHat subscriptions for all our nodes, of course. We keep our yum repository 
> on a Satellite server. But that only works with the configuration in 
> /etc/yum.repos.d, not with the transient one one that genimage generates on 
> the fly in /tmp. The actual problem is that the yum repository is protected 
> with a client certificate, which has to be added to the repository.
>  
> ___
> Kevin Keane | Systems Architect | University of San Diego ITS | 
> kke...@sandiego.edu 
> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
> 
> REMEMBER! No one from IT at USD will ever ask to confirm or supply your 
> password.
> These messages are an attempt to steal your username and password. Please do 
> not reply to, click the links within, or open the attachments of these 
> messages. Delete them!
> 
>  
>  
> On Tue, Sep 4, 2018 at 8:10 PM Song BJ Yang  > wrote:
> Hi Keith,
>  
> >  The references I see to creating repos (base, or updates) on the xCAT 
> > management node all indicate that only local repos are supported (e.g. 
> > "baseurl=file:///..").
>  
> for redhat diskless osimage, we do support online repo in pkgdir, for 
> example, 
> `chdef -t osimage  -o myosimage -p pkgdir=http://rh.com/updates 
> `
> --
> YANG Song (杨嵩)
> IBM China System Technology Laboratory
> Tel: 86-10-82452903
> Email: yang...@cn.ibm.com 
> Address: Building 28, ZhongGuanCun Software Park,
> No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC
> 
> 北京市海淀区东北旺西路8号中关村软件园28号楼
> 邮编: 100193
>  
>  
> - Original message -
> From: "Bin XA Xu" mailto:bx...@cn.ibm.com>>
> To: xcat-user@lists.sourceforge.net 
> Cc: xcat-user@lists.sourceforge.net 
> Subject: Re: [xcat-user] Using online repos with xCAT
> 

Re: [xcat-user] Using chrony instead of ntp?

2018-11-29 Thread Vinícius Ferrão
Thank you Gong.

I will report back if anything went offtrack.

> On 29 Nov 2018, at 04:10, GONG Jie  wrote:
> 
> Hello Vinícius Ferrão,
> 
> Please make sure chronyd is installed on the node you intend to use it. And 
> redo the xCAT ntp setup procedure on that node.
> 
> For the xCAT management node, do the following,
>   1. Make sure chronyd is installed on xCAT management node.
>   2. Run `makentp’
> 
> For xCAT compute nodes, do the following,
>   1. Make sure chronyd is installed on xCAT compute nodes you intend to use 
> chronyd
>   2. Run `xdsh compute_nodes setupntp’, where compute_nodes is your node 
> range.
> 
> GONG Jie (龚颉)
> China Systems & Technology Lab, Beijing
> Tel: +86-10-82454979
> Email: gong...@linux.vnet.ibm.com <mailto:gong...@linux.vnet.ibm.com>
> Address: Ring Building, No.28, Zhongguancun Software Park, No.8 Dongbeiwang 
> West Road, Haidian District, Beijing, P.R.China 100193
> 
> 
> 
>> On Nov 29, 2018, at 05:36, Vinícius Ferrão > <mailto:fer...@versatushpc.com.br>> wrote:
>> 
>> Hello Casandra,
>> 
>> Thanks for sharing the correct info. In a case of an already deployed xCAT 
>> on RHEL7, is it possible to change it back to chronyd instead of ntpd?
>> 
>> Which steps are needed?
>> 
>> Thanks,
>> 
>> Sent from my iPhone
>> 
>> On 28 Nov 2018, at 19:24, Casandra H Qiu > <mailto:cxh...@us.ibm.com>> wrote:
>> 
>>> If xCAT found there is chronyd on the system, it will use chrony, otherwise 
>>> it will default back to ntp.
>>> 
>>> 
>>> Thanks,
>>> Casandra Qiu
>>> 
>>> ...
>>> Casandra Hong Qiu
>>> Phone: (845) 433-9291, t/l 293-9291
>>> Office: Building 8, 3-B-04
>>> cxh...@us.ibm.com <mailto:cxh...@us.ibm.com>
>>> 
>>> 
>>> 
>>> Ryan Novosielski ---11/28/2018 03:49:48 PM---You happen to 
>>> know if this is possible to make use of also on RHEL7 without much fuss? 
>>> Sent from my
>>> 
>>> From: Ryan Novosielski mailto:novos...@rutgers.edu>>
>>> To: xCAT Users Mailing list >> <mailto:xcat-user@lists.sourceforge.net>>
>>> Date: 11/28/2018 03:49 PM
>>> Subject: Re: [xcat-user] Using chrony instead of ntp?
>>> 
>>> 
>>> 
>>> 
>>> You happen to know if this is possible to make use of also on RHEL7 without 
>>> much fuss?
>>> 
>>> Sent from my iPhone
>>> 
>>> On Nov 28, 2018, at 15:16, Casandra H Qiu >> <mailto:cxh...@us.ibm.com>> wrote:
>>> xCAT is modified the code on 2.14.3 to use chrony instead of ntp for RHEL8 
>>> support. 
>>> 
>>> 
>>> Thanks,
>>> Casandra Qiu
>>> 
>>> ...
>>> Casandra Hong Qiu
>>> Phone: (845) 433-9291, t/l 293-9291
>>> Office: Building 8, 3-B-04
>>> cxh...@us.ibm.com <mailto:cxh...@us.ibm.com>
>>> 
>>> 
>>> 
>>> Kevin Keane ---11/28/2018 02:21:59 PM---xCAT is designed to 
>>> use ntp, of course. But RedHat makes chrony the default (and in the 
>>> upcoming RHE
>>> 
>>> From: Kevin Keane mailto:kke...@sandiego.edu>>
>>> To: xCAT Users Mailing list >> <mailto:xcat-user@lists.sourceforge.net>>
>>> Date: 11/28/2018 02:21 PM
>>> Subject: [xcat-user] Using chrony instead of ntp?
>>> 
>>> 
>>> 
>>> 
>>> xCAT is designed to use ntp, of course. But RedHat makes chrony the default 
>>> (and in the upcoming RHEL 8, ntpd will not even be included any more).
>>> 
>>> What do I need to consider when using chrony instead of ntpd? Obviously, I 
>>> can't run makentp, and would have to configure it manually. Is there 
>>> anything else I need to pay attention to?
>>> 
>>> Thanks!
>>> ___
>>> Kevin Keane | Systems Architect | University of San Diego ITS | 
>>> kke...@sandiego.edu <mailto:kke...@sandiego.edu>
>>> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
>>> 
>>> REMEMBER! No one from IT at USD will ever ask to confirm or supply your 
>>> password.
>>> These messages are an attempt to steal your username and password. Please 
>>> do not reply to, click the links within, or op

[xcat-user] Dell Networking OS9 and OS6 Switches on xCAT

2018-12-05 Thread Vinícius Ferrão
Hello,

I would like know if someone deployed xCAT with switches from this series, it’s 
basically the Dell N and Dell S lines.

Accordingly to the documentation 
(https://xcat-docs.readthedocs.io/en/stable/advanced/networks/ethernet_switches/ethernet_switches.html#switch-management),
 xCAT only supports 4 vendors out-of-the-box, but there are instructions to add 
other devices.

So, if someone managed to put this switches under xCAT control, what was the 
procedure?

Thanks,



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Dell Networking OS9 and OS6 Switches on xCAT

2018-12-06 Thread Vinícius Ferrão
Hi Carl,

I wasn’t expecting a so complete answer like this. Thats awesome.

So you don’t have to create a switch definition as described here, on the Add 
New Switch Types section: 
https://xcat-docs.readthedocs.io/en/stable/advanced/networks/ethernet_switches/ethernet_switches.html#switch-management
 
<https://xcat-docs.readthedocs.io/en/stable/advanced/networks/ethernet_switches/ethernet_switches.html#switch-management>

I see that you run makegocons, so there’s a way to run a rcon switch and get in 
the serial console?

Thanks,

> On 5 Dec 2018, at 20:32, Carl  wrote:
> 
> We have xCat working fine with Dell 3048-ON switches (currently running 9.14).
> 
> We mostly used 
> https://xcat-docs.readthedocs.io/en/stable/advanced/networks/switchdiscover/switch_based_switch_discovery.html
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/networks/switchdiscover/switch_based_switch_discovery.html>
>  as a reference 
> 
> However I use version 2c, as I was never very successful with v3 and we dont 
> need the additional security.  
> 
> The other trap is that the documentation says that snmpv1 is used by default, 
> however from my observations v3 is used by default. 
> 
> On the switch you need the following configuration
> snmp-server community public ro
> 
> I am unsure what traps need to be enabled, for completeness here all of the 
> ones that we have enabled. I would appreciate if someone else on list could 
> say which ones are needed by xCat
> snmp-server enable traps bgp
> snmp-server enable traps snmp authentication coldstart linkdown linkup 
> syslog-reachable syslog-unreachable
> snmp-server enable traps vrrp
> snmp-server enable traps lacp
> snmp-server enable traps entity
> snmp-server enable traps stack
> snmp-server enable traps stp
> snmp-server enable traps ecfm
> snmp-server enable traps vlt
> snmp-server enable traps xstp
> snmp-server enable traps isis
> snmp-server enable traps config
> snmp-server enable traps envmon cam-utilization fan supply temperature
> snmp-server enable traps ecmp
> 
> 
> Then its just a matter of setting up the switch definition and pointing the 
> hosts at it. The following is based on some nodes that were recently added to 
> my system, so you would need to modify for your environment
> # create the switch definition with snmp settings
> mkdef -t node 1g05 groups=switches ip=100.64.24.27  nodetype=switch 
> snmppassword=public snmpversion=2c
> # create a node group that defines the switchport based on the node number 
> mkdef -t group 1g05-d03-41 switch=1g05 switchport='|\D+(\d+)|($1-60)|'
> # create the node definitions
> mkdef -t node comp[061-080] groups=compute,1g05-d03-41,all,ipmi
> makehosts
> makedns
> makegocons
> 
> Then you can turn on the nodes they should be discovered and setup. 
> 
> Cheers,
> 
> Carl.
> 
> 
> 
> On Thu, 6 Dec 2018 at 03:07, Vinícius Ferrão  <mailto:fer...@versatushpc.com.br>> wrote:
> Hello,
> 
> I would like know if someone deployed xCAT with switches from this series, 
> it’s basically the Dell N and Dell S lines.
> 
> Accordingly to the documentation 
> (https://xcat-docs.readthedocs.io/en/stable/advanced/networks/ethernet_switches/ethernet_switches.html#switch-management
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/networks/ethernet_switches/ethernet_switches.html#switch-management>),
>  xCAT only supports 4 vendors out-of-the-box, but there are instructions to 
> add other devices.
> 
> So, if someone managed to put this switches under xCAT control, what was the 
> procedure?
> 
> Thanks,
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net <mailto:xCAT-user@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/xcat-user 
> <https://lists.sourceforge.net/lists/listinfo/xcat-user>
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Sequential-based discovery when BMC is on management network

2018-12-01 Thread Vinícius Ferrão
Hello,

We have some machines with shared BMC on the first network interface.

On other cluster management softwares - Rocks for instance - things get really 
messy when BMC is discovered as a compute node during the process.

So, how xCAT handles this scenarios? It properly discovers the BMC as a BMC 
device or it should be done with some kind of intervention?

I took a look at the docs but it only covers scenarios with a segregated BMC 
network: 
https://xcat-docs.readthedocs.io/en/stable/guides/admin-guides/manage_clusters/ppc64le/discovery/seq_discovery.html

Thanks,


Sent from my iPhone___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Using chrony instead of ntp?

2018-11-28 Thread Vinícius Ferrão
Hello Casandra,

Thanks for sharing the correct info. In a case of an already deployed xCAT on 
RHEL7, is it possible to change it back to chronyd instead of ntpd?

Which steps are needed?

Thanks,

Sent from my iPhone

> On 28 Nov 2018, at 19:24, Casandra H Qiu  wrote:
> 
> If xCAT found there is chronyd on the system, it will use chrony, otherwise 
> it will default back to ntp.
> 
> 
> Thanks,
> Casandra Qiu
> 
> ...
> Casandra Hong Qiu
> Phone: (845) 433-9291, t/l 293-9291
> Office: Building 8, 3-B-04
> cxh...@us.ibm.com
> 
> 
> 
> Ryan Novosielski ---11/28/2018 03:49:48 PM---You happen to know 
> if this is possible to make use of also on RHEL7 without much fuss? Sent from 
> my
> 
> From: Ryan Novosielski 
> To: xCAT Users Mailing list 
> Date: 11/28/2018 03:49 PM
> Subject: Re: [xcat-user] Using chrony instead of ntp?
> 
> 
> 
> 
> You happen to know if this is possible to make use of also on RHEL7 without 
> much fuss?
> 
> Sent from my iPhone
> 
> On Nov 28, 2018, at 15:16, Casandra H Qiu  wrote:
> xCAT is modified the code on 2.14.3 to use chrony instead of ntp for RHEL8 
> support. 
> 
> 
> Thanks,
> Casandra Qiu
> 
> ...
> Casandra Hong Qiu
> Phone: (845) 433-9291, t/l 293-9291
> Office: Building 8, 3-B-04
> cxh...@us.ibm.com
> 
> 
> 
> Kevin Keane ---11/28/2018 02:21:59 PM---xCAT is designed to use 
> ntp, of course. But RedHat makes chrony the default (and in the upcoming RHE
> 
> From: Kevin Keane 
> To: xCAT Users Mailing list 
> Date: 11/28/2018 02:21 PM
> Subject: [xcat-user] Using chrony instead of ntp?
> 
> 
> 
> xCAT is designed to use ntp, of course. But RedHat makes chrony the default 
> (and in the upcoming RHEL 8, ntpd will not even be included any more).
> 
> What do I need to consider when using chrony instead of ntpd? Obviously, I 
> can't run makentp, and would have to configure it manually. Is there anything 
> else I need to pay attention to?
> 
> Thanks!
> ___
> Kevin Keane | Systems Architect | University of San Diego ITS | 
> kke...@sandiego.edu
> Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859
> 
> REMEMBER! No one from IT at USD will ever ask to confirm or supply your 
> password.
> These messages are an attempt to steal your username and password. Please do 
> not reply to, click the links within, or open the attachments of these 
> messages. Delete them!
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user[attachment 
> "graycol.gif" deleted by Casandra H Qiu/Poughkeepsie/IBM] 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Removing BIND from xCAT

2019-09-15 Thread Vinícius Ferrão via xCAT-user
Jon, thank you so much.

I will take sometime to test this changes with your comments. I will report 
back to the list.

Really, this message was awesome.

Thank you one more time.
V.

> On 12 Sep 2019, at 06:11, Jon Diprose  wrote:
> 
> Hi Vinícius,
>  
> I am looking at exactly this at the moment. My experience so far is that:
>  
> - xCAT’s ‘makedns -e’ uses TSIG to update at least the first dns server in 
> the master’s /etc/resolv.conf
> - xCAT’s TSIG key appears to be hmac-md5
> + I’d like to know if I could go to hmac-sha512 instead but I think that may 
> be hardcoded as the hashing function declaration isn’t in the omapi entry of 
> the password table, just the secret
> - https://www.freeipa.org/page/Howto/DNS_updates_and_zone_transfers_with_TSIG 
> <https://www.freeipa.org/page/Howto/DNS_updates_and_zone_transfers_with_TSIG> 
> basically applies
> - add the xcat_key stanza to the /etc/named.conf files and ‘rndc reload’ on 
> all FreeIPA replicas
> - for the relevant FreeIPA forward zones the update-policy ‘grant xcat_key 
> zonesub A CNAME;’ is required in addition to whatever is already there
> + if you are doing that at the command line, ‘ipa dnszone-show zone.name. 
> --all’ shows the existing policy
> + note that ‘ipa dnszone-mod zone.name. --update-policy …’ replaces and does 
> not append
> - for the relevant FreeIPA reverse zones the update-policy ‘grant xcat_key 
> zonesub PTR;’ is required in addition to whatever is already there
> - those may not be the most appropriate policy wordings but they work for me
> - ‘ipa dnszone-mod zone.name. --dynamic-update true’ is required for both 
> forward and reverse zones
> - the ‘@’ records and Authoritative Server settings that FreeIPA creates by 
> default may need adjusting if those defaults are not reachable by your xCAT 
> master
> - you can test the talking-to-FreeIPA bit without any of the xCAT stuff using 
> the ‘nsupdate’ command
> - I haven’t yet attempted to enrol my xCAT master as an IPA client so I’ve no 
> idea if kinit’ing with appropriate privilege would make the TSIG key work 
> unnecessary - I don’t know if xCAT can speak GSS-TSIG
>  
>  
> ‘makedns -e’ now almost works for me - it updates the all IPA dns records 
> that I am expecting from my xCAT config and a few more I wasn’t expecting 
> from having manually added stuff to my /etc/hosts, all without touching the 
> existing local config. It is still returning an exit code of 1 so there’s 
> still something to track down, but I think that is now down to 
> inconsistencies and oddities in my xCAT config and /etc/hosts file, 
> complicated by my particular setup not being authoritative for some domains I 
> use.
>  
> I also ship fully-populated /etc/hosts files to all our xCAT-managed nodes, 
> so I’m hoping for a seamless changeover when redirecting the nodes to the 
> FreeIPA DNS instances instead of the one on the xCAT master.
>  
> I hope that helps and I’d appreciate hearing about anything you learn along 
> the way!
>  
> Jon
>  
> --
> Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>> 
> Tel: 01865 287837
> Research Computing Manager
> Henry Wellcome Building for Genomic Medicine Roosevelt Drive, Headington, 
> Oxford OX3 7BN
>  
> From: Vinícius Ferrão via xCAT-user [mailto:xcat-user@lists.sourceforge.net] 
> Sent: 11 September 2019 15:32
> To: xCAT Users Mailing list
> Cc: Vinícius Ferrão
> Subject: [xcat-user] Removing BIND from xCAT
>  
> Hello,
>  
> I’ve came across this documentation page:
> https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html#option-2-use-a-dns-that-is-outside-of-the-cluster
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html#option-2-use-a-dns-that-is-outside-of-the-cluster>
>  
> And it says specifically that I can use an external DNS server.
>  
> So the point is, with this option xCAT does not even use the shipped BIND?
>  
> Can it coexist with another BIND daemon on the same machine?
>  
> I’m interested in installing FreeIPA and enabling DNS integrated Zones, so 
> FreeIPA handles the DNS service.
>  
> Thanks,
>  
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user



smime.p7s
Description: S/MIME cryptographic signature
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] xCAT forcibly disabling SELinux and firewalld

2019-09-26 Thread Vinícius Ferrão via xCAT-user
Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce 
Permissive
[root@headnode ~]# systemctl status firewalld
● firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor 
preset: enabled)
   Active: inactive (dead)
 Docs: man:firewalld(1)

Sep 26 02:55:55 headnode.cluster.iq.ufrj.br systemd[1]: Starting firewalld - 
dynamic firewall daemon...
Sep 26 02:55:56 headnode.cluster.iq.ufrj.br systemd[1]: Started firewalld - 
dynamic firewall daemon.
Sep 26 03:09:18 headnode.cluster.iq.ufrj.br systemd[1]: Stopping firewalld - 
dynamic firewall daemon...
Sep 26 03:09:21 headnode.cluster.iq.ufrj.br systemd[1]: Stopped firewalld - 
dynamic firewall daemon.

There’s a way to avoid this behaviour?

Thanks,

PS: I’m aware of the consequences of firewalld and SELinux in xCAT environments.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] xCAT forcibly disabling SELinux and firewalld

2019-09-26 Thread Vinícius Ferrão via xCAT-user
gt;: grep Subject 
/etc/xcat/cert/server-cert.pem 2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: find 
/usr/share/zoneinfo -xtype f -exec cmp -s /etc/localtime {} \; -print | grep -v 
posix | grep -v SystemV | grep -v right | grep -v localtime  2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: 
/install/postscripts/syslog 2>&1

syslog has been set up.
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: systemctl 
set-environment TERM=xterm-256color 2>&1

Imported TERM=xterm-256color into systemd.
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: systemctl 
enable named 2>&1

Could not enable dns server.
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: XCATBYPASS=Y 
/opt/xcat/sbin/makenetworks 2>&1

The makenetworks command was run with no error.
httpd has been restarted.
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: systemctl 
enable httpd 2>&1

httpd has been enabled.
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: systemctl 
enable dhcpd 2>&1

SELINUX is not disabled, disabling it now...
Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: echo 0 > 
/sys/fs/selinux/enforce 2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: sed -i 
's/^SELINUX=.*$/SELINUX=disabled/' /etc/selinux/config 2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: type -P 
SuSEfirewall2 >/dev/null 2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: type -P 
SuSEfirewall2 >/dev/null 2>&1

Running command on 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br>: systemctl 
disable firewalld 2>&1

xCAT is now running, it is recommended to tabedit networks
and set a dynamic ip address range on any networks where nodes
are to be discovered. Then, run makedhcp -n to create a new dhcpd
configuration file, and /etc/init.d/dhcpd restart. Either examine sample
configuration templates, or write your own, or specify a value per
node with nodeadd or tabedit.


On 26 Sep 2019, at 13:11, Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

Hello Jarod, so you’re the guy who can help it out :)

The message was originally about firewalld and SELinux but can be extended to a 
lot of other things. Like the automatic names that xCAT creates for networks 
and things like this.

So I think a package without automatic installation is really welcoming. But to 
this there’s a place we’re it’s documented everything that the xCAT package 
install do?

If not there’s a way to get this from the package? I think the commands are 
sufficient.

In a complex or custom environment manual installation, with other tools 
automating, things is a good ideia.

Thanks.

Sent from my iPhone

On 26 Sep 2019, at 03:54, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:


I've been considering removing all of that from executing on rpm install (also 
enabling services to start on boot just by installing rpm)

It was added for convenience of not asking to run a setup after install but it 
is inconsistent with general rpm behavior and limits ability to use flags to 
customize behavior.

On the flip side, this would be a change that people would have to learn and 
would surprise new installs.

I might make variant of the xCAT meta package with no auto setup so that people 
won't be surprised unless they opt into the other package.

Looking for thoughts.

For wider information, it doesn't yet have os deployment, but confluent has 
been developing and designing specifically with firewall and selinux in mind, 
as well as trying to mitigate the initial setup complexity that drove us to 
create xcatconfig in the first place.  For example no more tls certs required 
for local access and os import will no longer loop mount isos (one of the 
biggest selinux problems) and avoid rewriting other service etc files in daemon 
context.  More straightforward network usage and a documented set of firewalld 
commands.

From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: Thursday, September 26, 2019 2:27:10 AM
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: [External] [xcat-user] xCAT forcibly disabling SELinux and firewalld

Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce
Permissive
[root@headnode ~]# systemctl status firewalld
●

Re: [xcat-user] [External] xCAT forcibly disabling SELinux and firewalld

2019-09-26 Thread Vinícius Ferrão via xCAT-user
Hello Kevin, I’ve answered at the same time.

Take a look at the answer, there’s a command that does everything. It really 
nails down to avoid running the command. In the xCAT package only runs on the 
first time:

. /etc/profile.d/xcat.sh
if [ "$1" = "1" ]; then #Only if installing for the first time..
$RPM_INSTALL_PREFIX0/sbin/xcatconfig -i
else
if [ -r "/tmp/xcat/installservice.pid" ]; then
  mv /tmp/xcat/installservice.pid /var/run/xcat/installservice.pid
fi
if [ -r "/tmp/xcat/udpservice.pid" ]; then
  mv /tmp/xcat/udpservice.pid /var/run/xcat/udpservice.pid
fi
if [ -r "/tmp/xcat/mainservice.pid" ]; then
  mv /tmp/xcat/mainservice.pid /var/run/xcat/mainservice.pid
fi



On 26 Sep 2019, at 13:38, Kevin Keane 
mailto:kke...@sandiego.edu>> wrote:

Just a thought - you could get the best of both worlds by removing this 
behavior from the RPMs, and creating a separate "setup" RPM that does all these 
things.

This behavior should really be removed from the main RPMs because otherwise, 
these actions are repeated on updates.

___
Kevin Keane | Systems Architect | University of San Diego ITS | 
kke...@sandiego.edu<mailto:kke...@sandiego.edu>
Maher Hall, 192 |5998 Alcalá Park | San Diego, CA 92110-2492 | 619.260.6859 | 
Text: 760-721-8339

REMEMBER! No one from IT at USD will ever ask to confirm or supply your 
password.
These messages are an attempt to steal your username and password. Please do 
not reply to, click the links within, or open the attachments of these 
messages. Delete them!



On Wed, Sep 25, 2019 at 11:54 PM Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:
I've been considering removing all of that from executing on rpm install (also 
enabling services to start on boot just by installing rpm)

It was added for convenience of not asking to run a setup after install but it 
is inconsistent with general rpm behavior and limits ability to use flags to 
customize behavior.

On the flip side, this would be a change that people would have to learn and 
would surprise new installs.

I might make variant of the xCAT meta package with no auto setup so that people 
won't be surprised unless they opt into the other package.

Looking for thoughts.

For wider information, it doesn't yet have os deployment, but confluent has 
been developing and designing specifically with firewall and selinux in mind, 
as well as trying to mitigate the initial setup complexity that drove us to 
create xcatconfig in the first place.  For example no more tls certs required 
for local access and os import will no longer loop mount isos (one of the 
biggest selinux problems) and avoid rewriting other service etc files in daemon 
context.  More straightforward network usage and a documented set of firewalld 
commands.

From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: Thursday, September 26, 2019 2:27:10 AM
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: [External] [xcat-user] xCAT forcibly disabling SELinux and firewalld

Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce
Permissive
[root@headnode ~]# systemctl status firewalld
● firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor 
preset: enabled)
   Active: inactive (dead)
 Docs: man:firewalld(1)

Sep 26 02:55:55 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Starting firewalld - dynamic firewall daemon...
Sep 26 02:55:56 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Started firewalld - dynamic firewall daemon.
Sep 26 03:09:18 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Stopping firewalld - dynamic firewall daemon...
Sep 26 03:09:21 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Stopped firewalld - dynamic firewall daemon.

There’s a way to avoid this behaviour?

Thanks,

PS: I’m aware of the consequences of firewalld and SELinux in xCAT environments.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

__

Re: [xcat-user] [External] xCAT forcibly disabling SELinux and firewalld

2019-09-26 Thread Vinícius Ferrão via xCAT-user
Hello Jarod, so you’re the guy who can help it out :)

The message was originally about firewalld and SELinux but can be extended to a 
lot of other things. Like the automatic names that xCAT creates for networks 
and things like this.

So I think a package without automatic installation is really welcoming. But to 
this there’s a place we’re it’s documented everything that the xCAT package 
install do?

If not there’s a way to get this from the package? I think the commands are 
sufficient.

In a complex or custom environment manual installation, with other tools 
automating, things is a good ideia.

Thanks.

Sent from my iPhone

On 26 Sep 2019, at 03:54, Jarrod Johnson  wrote:


I've been considering removing all of that from executing on rpm install (also 
enabling services to start on boot just by installing rpm)

It was added for convenience of not asking to run a setup after install but it 
is inconsistent with general rpm behavior and limits ability to use flags to 
customize behavior.

On the flip side, this would be a change that people would have to learn and 
would surprise new installs.

I might make variant of the xCAT meta package with no auto setup so that people 
won't be surprised unless they opt into the other package.

Looking for thoughts.

For wider information, it doesn't yet have os deployment, but confluent has 
been developing and designing specifically with firewall and selinux in mind, 
as well as trying to mitigate the initial setup complexity that drove us to 
create xcatconfig in the first place.  For example no more tls certs required 
for local access and os import will no longer loop mount isos (one of the 
biggest selinux problems) and avoid rewriting other service etc files in daemon 
context.  More straightforward network usage and a documented set of firewalld 
commands.

From: Vinícius Ferrão via xCAT-user 
Sent: Thursday, September 26, 2019 2:27:10 AM
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: [External] [xcat-user] xCAT forcibly disabling SELinux and firewalld

Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce
Permissive
[root@headnode ~]# systemctl status firewalld
● firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor 
preset: enabled)
   Active: inactive (dead)
 Docs: man:firewalld(1)

Sep 26 02:55:55 headnode.cluster.iq.ufrj.br systemd[1]: Starting firewalld - 
dynamic firewall daemon...
Sep 26 02:55:56 headnode.cluster.iq.ufrj.br systemd[1]: Started firewalld - 
dynamic firewall daemon.
Sep 26 03:09:18 headnode.cluster.iq.ufrj.br systemd[1]: Stopping firewalld - 
dynamic firewall daemon...
Sep 26 03:09:21 headnode.cluster.iq.ufrj.br systemd[1]: Stopped firewalld - 
dynamic firewall daemon.

There’s a way to avoid this behaviour?

Thanks,

PS: I’m aware of the consequences of firewalld and SELinux in xCAT environments.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Removing BIND from xCAT

2019-09-22 Thread Vinícius Ferrão via xCAT-user
 defaults are not reachable by your xCAT 
> master
> - you can test the talking-to-FreeIPA bit without any of the xCAT stuff using 
> the ‘nsupdate’ command
> - I haven’t yet attempted to enrol my xCAT master as an IPA client so I’ve no 
> idea if kinit’ing with appropriate privilege would make the TSIG key work 
> unnecessary - I don’t know if xCAT can speak GSS-TSIG
>  
>  
> ‘makedns -e’ now almost works for me - it updates the all IPA dns records 
> that I am expecting from my xCAT config and a few more I wasn’t expecting 
> from having manually added stuff to my /etc/hosts, all without touching the 
> existing local config. It is still returning an exit code of 1 so there’s 
> still something to track down, but I think that is now down to 
> inconsistencies and oddities in my xCAT config and /etc/hosts file, 
> complicated by my particular setup not being authoritative for some domains I 
> use.
>  
> I also ship fully-populated /etc/hosts files to all our xCAT-managed nodes, 
> so I’m hoping for a seamless changeover when redirecting the nodes to the 
> FreeIPA DNS instances instead of the one on the xCAT master.
>  
> I hope that helps and I’d appreciate hearing about anything you learn along 
> the way!
>  
> Jon
>  
> --
> Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>> 
> Tel: 01865 287837
> Research Computing Manager
> Henry Wellcome Building for Genomic Medicine Roosevelt Drive, Headington, 
> Oxford OX3 7BN
>  
> From: Vinícius Ferrão via xCAT-user [mailto:xcat-user@lists.sourceforge.net] 
> Sent: 11 September 2019 15:32
> To: xCAT Users Mailing list
> Cc: Vinícius Ferrão
> Subject: [xcat-user] Removing BIND from xCAT
>  
> Hello,
>  
> I’ve came across this documentation page:
> https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html#option-2-use-a-dns-that-is-outside-of-the-cluster
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html#option-2-use-a-dns-that-is-outside-of-the-cluster>
>  
> And it says specifically that I can use an external DNS server.
>  
> So the point is, with this option xCAT does not even use the shipped BIND?
>  
> Can it coexist with another BIND daemon on the same machine?
>  
> I’m interested in installing FreeIPA and enabling DNS integrated Zones, so 
> FreeIPA handles the DNS service.
>  
> Thanks,
>  
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user



smime.p7s
Description: S/MIME cryptographic signature
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Timezone settings: how it works?

2019-12-13 Thread Vinícius Ferrão via xCAT-user
Hello,

I’m with an issue with timezone settings of xCAT.

I do have the configuration in place on xCAT:
[root@headnode etc]# lsdef -t site clustersite | grep timezone
timezone=America/Sao_Paulo

But when I run packimage it gets a timezone file from a place that I don’t know 
which is.

Here’s the md5 of /etc/localtime inside the image:
md5sum /install/netboot/centos7.6/x86_64/compute/rootimg/etc/localtime 
346a7b10fbc17af944896bad4d093568

It should be this one:
md5sum /etc/localtime 
339ff8dbd88e39f59b2fd4ee57b1fdb1  /etc/localtime

I even tried search for this specific md5 on /usr/share/zoneinfo but I was 
unable to find it:
find . -type f -exec md5sum {} + | grep 346a7b10fbc17af944896bad4d093568

So the question is, from where xCAT gets the zone info when running packimage?

Thanks,


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] What is the purpose of moncfg, monadd, monstart commands?

2019-12-12 Thread Vinícius Ferrão via xCAT-user
Hello Casandra,

There’s a list of supported features of xCAT 2.15? Because we hit this issue 
and we were scratching our heads for days. As far as I know there’s a lot of 
legacy code on the GitHub, and even some left overs on 
xcat-docs.readthedocs.io; the xcat mon is on 
this page for instance.

We tried to use it because there’s still man pages for this, so we thought it 
was functional.

Thank you.

Vinícius.


On 12 Dec 2019, at 11:52, Casandra H Qiu 
mailto:cxh...@us.ibm.com>> wrote:


If the documentation didn't move from sourceforge to readthedocs, most likely 
xCAT no longer support it,
The functions still in the xCAT, but don't think it's up to date.
Please fell free and welcome to open Pull Request if you want to make some 
modification.

I saw some of commands are in the readthedocs
https://xcat-docs.readthedocs.io/en/stable/guides/admin-guides/references/man1/monstart.1.html
https://xcat-docs.readthedocs.io/en/stable/guides/admin-guides/references/man1/monstop.1.html


Thanks,
Casandra
...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



Daniel Hilst via xCAT-user ---12/10/2019 03:43:52 PM---I followed 
this doc: INVALID URI REMOVED

From: Daniel Hilst via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc: Daniel Hilst mailto:dan...@versatushpc.com.br>>
Date: 12/10/2019 03:43 PM
Subject: [EXTERNAL] Re: [xcat-user] What is the purpose of moncfg, monadd, 
monstart commands?





I followed this doc: 
https://sourceforge.net/p/xcat/wiki/Monitoring_an_xCAT_Cluster/#ganglia-monitoring

Is there any up to date documentation? I couldn't find it on readthedocs

Regards


De: Daniel Hilst
Enviado: terça-feira, 10 de dezembro de 2019 17:11
Para: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Assunto: What is the purpose of moncfg, monadd, monstart commands?

I configuring ganglia on a cluster that runs xcat, and I came across this page

I followed the instructions for ganglia, but I'm using openhpc ganglia so I did 
some modifications here or there. What I can't understand is what is the 
purpose of these commands? To configure computing nodes at ganglia 
configuration files?

And monstart/monstop that they do?

Best regards,___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Support for ARM booting

2019-10-24 Thread Vinícius Ferrão via xCAT-user
Hello,

I’m adding some ARM compute nodes on an xCAT and I got the bad news: xCAT does 
not support aarch64.

Looking around there’s some information about prototypes:
https://sourceforge.net/p/xcat/mailman/message/35161961/

Theres an ticket about this too here:
https://github.com/xcat2/xcat-core/issues/2946

So the question is: what’s the state of aarch64 today? There’s any workaround, 
even unofficial one to at least boot aarch64 nodes from a standard x86_64 
management node?

Thanks,

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] What is the difference of pkglist and otherpkglist

2019-11-01 Thread Vinícius Ferrão via xCAT-user
Hello Arif, interesting way to do this things. I use a completely unapproved 
method of chrooting and yum installroot, which I know is broken.

But I had a different understanding of pkglist and otherpkglist. I was 
understanding that otherpkgs is for things that are standalone. For instance 
some scientific comercial software that provides a single RPM.

Everything that have a web repository, like packages from EPEL, OpenHPC, 
Mellanox OFED, etc, should go on pkglist.

One thing we can agree: this part of xCAT is extremely confusing and not well 
explained.

Sent from my iPhone

On 1 Nov 2019, at 09:51, Arif Ali  wrote:


I need to install some softwares from OpenHPC repository and I'm in doubt about 
which one to use,  pkglist/dir or otherpkglist/dir?

What is the difference between this two?

Hi Daniel,

This is by no means a way that xCAT guys document it, but this is how we do it, 
and how we see things

pkglist and pkgdir is for those packages that is from the distribution 
operating system, and therefore you should be able to just list these packages 
without naming any folders as such. These would be installed as part of 
anaconda (if you're a redhat based person). For example, we normally have 2 
directories in the pkgdir, one for the base OS, and one for the updates (i.e. 
using CentOS base and updates) that have been sync'd to the xCAT master. Then 
while the machine is installing, it will install the updated packages, rather 
than having to install the base first, then apply the updates. This allows to 
have the latest kernel by default, and don't have to do extra reboots 
afterwards.

The otherpkgdir and the otherpkglist is used for any packages that are not 
provided by the distribution operating system, and the packages from the web. I 
would normally add extras, opstools, slurm, cuda in this case

In your scenario, we would go with adding OpenHPC into the otherpkglist

I hope that makes sense

regards,
Arif

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Removing BIND from xCAT

2019-09-24 Thread Vinícius Ferrão via xCAT-user
Jon, I’ve figured, at least for my case, all the issues:

Issue 1: was solved removing the public IP address and hostname from the 
/etc/hosts file. So when makedns is does not complains anymore about the 
reverse and forward public zones.

Issues 2 and 3: the hpclab.iq.ufrj.br.cluster.iq.ufrj.br. ridiculous name was 
fixed automatically after solving 1.

So what I think is happening: xCAT does not read from it’s database to run 
makedns. It depends on /etc/hosts… the public network still exists in the lsdef 
-t network command:

[root@hpclab ~]# lsdef -t network
10_0_0_0-255_255_255_0  (network)
146_164_29_0-255_255_255_0  (network)

What do I have now? I think I will try a redeploy with those modified settings 
to see what’s happen. More issues will come, I think when deploying FreeIPA in 
a manner that xCAT makedns’ likes.

That’s it for the server part.

Let me ask another thing: how the clients authenticate? I’m using stateless 
images an this is and issue since I can’t run ipa-client-install on the chroot 
image.

I was thinking in running ipa-client-install unattended everytime a node boots. 
But this isn’t really well documented and tested. Some solution may come with 
force re-enrollment:

https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/6/html/identity_management_guide/linux-manual
https://www.freeipa.org/page/V3/Forced_client_re-enrollment

But it’s not tested yet…

Thanks,



On 23 Sep 2019, at 09:00, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Hi Vinícius,

As I said, still some oddities!

Yes, I too failed with what I thought should be the correct explicit zone 
specification but found that zonesub probably did what I was expecting. We need 
a FreeIPA DNS expert to answer that one.

I think some of issue 2 & 3 comes down to having the xCAT networks and nics 
tables set up appropriately. For me, I think I see that nics whose associated 
networks record points to a different nameserver don’t result in an update 
attempt. I get a message of the form:

Ignoring host xxx-lan, it does not belong to any nets defined in networks table 
or the net it belongs to is configured to use an external nameserver.

where xxx is a node that xCAT knows about, with a secondary nic with suffix 
‘-lan’ and  associated with a networks record in turn configured with a 
different nameserver, that nameserver not being the IPA instance in 
/etc/resolv.conf that xCAT is trying to update. I also think this extends to 
any aliases on such nics, which is where I configure the “real” external 
hostname.

But yes, the non-local /etc/hosts entries that I haven’t told xCAT about end up 
with the internal domain appended to their fq-external-dn hostname. I think 
this is a fail-down from what you saw with your issue 1 forward zone failures - 
xCAT fails with just the external name and then tries again with the internal 
domain appended. In practice, it makes no difference until such time as the A 
record IPA returns has the wrong IP and I’ve dealt with it by manually purging 
such entries from IPA. The behaviour is predictable, so fixing it is 
scriptable, not that I’ve done it.

At some point I’ll tell xCAT about everything I’m interested in and see if that 
helps. It may in fact be that I only see such internal-domain-appended updates 
going to IPA for domains not declared in the networks table. The search domains 
declared in /etc/resolv.conf may also have an effect.

As per the above, I think issue 1 is a further symptom of the remaining 2 & 3 
issues - so will disappear to the extent that you can prevent the inappropriate 
DNS update attempts. On the plus side, these updates both should fail and do 
fail, which means that, other than the log message, they aren’t a problem.

We’d need someone who knows about how makedns actually works to know exactly 
how we should be doing it!

Jon

--
Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>>
 Tel: 01865 287837
Research Computing Manager
Henry Wellcome Building for Genomic Medicine Roosevelt Drive, Headington, 
Oxford OX3 7BN

From: Vinícius Ferrão via xCAT-user [mailto:xcat-user@lists.sourceforge.net]
Sent: 22 September 2019 19:23
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: Re: [xcat-user] Removing BIND from xCAT

Jon, I was an amateur.

You already said the exactly string that I need to make DNS update with the 
grant policy. Sorry for this mess. I was able to understand exactly what’s 
going on in this website: http://www.zytrax.com/books/dns/ch7/xfer.html

zonesub: The RR name being updated must match anything containing the zone name 
(as it appears in the zone clause containing this update-policy), including 
subdomains (any labels on the left) of this zone name. The optional tname field 
must be omitted when using this form.

LOL.

Anyway the issue 1. is still valid and happening.

Solving the issues for 2 and 3, I’ve came through another issue: the external 
DNS name is added to the internal DNS with

Re: [xcat-user] Removing BIND from xCAT

2019-09-22 Thread Vinícius Ferrão via xCAT-user
Jon, I was an amateur.

You already said the exactly string that I need to make DNS update with the 
grant policy. Sorry for this mess. I was able to understand exactly what’s 
going on in this website: http://www.zytrax.com/books/dns/ch7/xfer.html 
<http://www.zytrax.com/books/dns/ch7/xfer.html>

zonesub: The RR name being updated must match anything containing the zone name 
(as it appears in the zone clause containing this update-policy), including 
subdomains (any labels on the left) of this zone name. The optional tname field 
must be omitted when using this form.

LOL.

Anyway the issue 1. is still valid and happening.

Solving the issues for 2 and 3, I’ve came through another issue: the external 
DNS name is added to the internal DNS with the domain appended, so I got the 
following registry:

hpclab.iq.ufrj.br.cluster.iq.ufrj.br.

Thanks,


> On 22 Sep 2019, at 14:52, Vinícius Ferrão  wrote:
> 
> Hello Jon, I’m having issues with the setup.
> 
> First the enhancement things that may be welcoming for you:
> # chdef -t site externaldns=1
> 
> With this in place you can only use makedns instead of makedns -e. I think 
> it’s a good idea to set it up to avoid messing with the local named daemon. 
> In case of you forgetting to put -e in makedns command.
> 
> Now the bad things.
> 1. Makedns insists in updating zones that I do not have control. For 
> instance, the external domain name and the reverse IP of the external name:
> 
> Error: [hpclab]: Failure encountered updating 29.164.146.IN-ADDR.ARPA. with 
> entry '', error was REFUSED. See more details in system log.
> Error: [hpclab]: Failure encountered updating 29.164.146.IN-ADDR.ARPA. with 
> entry '', error was REFUSED. See more details in system log.
> Error: [hpclab]: Failure encountered updating 29.164.146.IN-ADDR.ARPA. with 
> entry '', error was REFUSED. See more details in system log.
> Error: [hpclab]: Failure encountered updating 29.164.146.IN-ADDR.ARPA. with 
> entry '', error was REFUSED. See more details in system log.
> Error: [hpclab]: Failure encountered updating iq.ufrj.br 
> <http://iq.ufrj.br/>. with entry '', error was REFUSED. See more details in 
> system log.
> Error: [hpclab]: Failure encountered updating iq.ufrj.br 
> <http://iq.ufrj.br/>. with entry '', error was REFUSED. See more details in 
> system log.
> Error: [hpclab]: Failure encountered updating iq.ufrj.br 
> <http://iq.ufrj.br/>. with entry '', error was REFUSED. See more details in 
> system log.
> Error: [hpclab]: Failure encountered updating iq.ufrj.br 
> <http://iq.ufrj.br/>. with entry '', error was REFUSED. See more details in 
> system log.
> 
> There’s a way to skip this zones? This happens because of my external 
> addresses.
> 
> 2. For reasons unknown I can’t make the grants for the internal zone:
> 
> Sep 22 14:48:04 hpclab named-pkcs11[21244]: 'CLUSTER.IQ.UFRJ.BR' unexpected
> Sep 22 14:48:04 hpclab named-pkcs11[21244]: zone cluster.iq.ufrj.br/IN: 
> <http://cluster.iq.ufrj.br/IN:> failed to parse policy string
> Sep 22 14:48:04 hpclab named-pkcs11[21244]: zone cluster.iq.ufrj.br/IN: 
> <http://cluster.iq.ufrj.br/IN:> disabling all updates because of error in 
> update policy configuration: unexpected token
> 
> I’m using this policy in cluster.iq.ufrj.br <http://cluster.iq.ufrj.br/>;
> 
> grant CLUSTER.IQ.UFRJ.BR krb5-self * A; grant CLUSTER.IQ.UFRJ.BR krb5-self * 
> ; grant CLUSTER.IQ.UFRJ.BR krb5-self * SSHFP; grant xcat_key 
> CLUSTER.IQ.UFRJ.BR A CNAME;
> 
> 3. The same thing happens for the reverse zone:
> 
> Sep 22 14:50:35 hpclab named-pkcs11[21244]: '0.0.10.in-addr.arpa.' unexpected
> Sep 22 14:50:35 hpclab named-pkcs11[21244]: zone 0.0.10.in-addr.arpa/IN: 
> failed to parse policy string
> Sep 22 14:50:35 hpclab named-pkcs11[21244]: zone 0.0.10.in-addr.arpa/IN: 
> disabling all updates because of error in update policy configuration: 
> unexpected token
> Sep 22 14:50:35 hpclab named-pkcs11[21244]: update_zone (syncrepl) failed for 
> master zone DN 
> 'idnsname=0.0.10.in-addr.arpa.,cn=dns,dc=cluster,dc=iq,dc=ufrj,dc=br'. Zones 
> can be outdated, run `rndc reload`: unexpected token
> 
> Using the policy in the reverse zone:
> grant CLUSTER.IQ.UFRJ.BR krb5-subdomain 0.0.10.in-addr.arpa. PTR; grant 
> xcat_key 0.0.10.in-addr.arpa. * PTR;
> 
> -x-x-x-
> 
> Regarding 2 and 3; it’s probably something wrong on the grant policy. But 
> I’ve followed your instructions and the instructions on the link that you’ve 
> attached in the original message.
> 
> What I’m missing?
> 
> Thanks,
> 
> 
>> On 12 Sep 2019, at 06:11, Jon Diprose > <mailto:j...@well.ox.ac.uk>> wrote:
>> 
>> Hi Vinícius,
>>  
>> I am looking at exa

Re: [xcat-user] Removing BIND from xCAT

2019-09-25 Thread Vinícius Ferrão via xCAT-user
Hello again Jon and folks.

I was able to join the stateless nodes with a pretty bad postboot script, 
something like this:
ipa-client-install --domain=cluster.example.com<http://cluster.example.com> -p 
admin -w adminpassword --force-join -U

I don’t like it, since the password is in plaintext on the script file. But 
that’s whats working right now. I was playing with Service Principals from 
FreeIPA to avoid this but nothing was really successful.

There’s a thread on FreeIPA users list that I’ve opened about this issue, some 
guys are helping out, definitely worth checking it out: 
https://lists.fedoraproject.org/archives/list/freeipa-us...@lists.fedorahosted.org/thread/HMMKHFABL4R4UBC62XWBMBH3MHYDPJGR/

Well, that’s it. Still needing to improve:
* The DNS mechanism at all.
* A service principal for xCAT? To make all the required changes, including 
nodes enrolment. If supported…
* If the service principal does not work at all, at least some “service 
account” for xCAT to do the required things instead of the admin account.
* The client authentication without plaintext password on postboot scripts.
* Perhaps some patches on xCAT code… makedns still depends on things like 
/etc/hosts instead of only depending on the xCAT DB, and GSSAPI?

Thank you so much for your help. I would not be able to solve this as fast as 
it was.

PS: Sorry for some writing mistakes in my last message, I was in a hurry in the 
airport and wasn’t able to review the message before sending.

On 24 Sep 2019, at 14:03, Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

Jon, I’ve figured, at least for my case, all the issues:

Issue 1: was solved removing the public IP address and hostname from the 
/etc/hosts file. So when makedns is does not complains anymore about the 
reverse and forward public zones.

Issues 2 and 3: the hpclab.iq.ufrj.br.cluster.iq.ufrj.br. ridiculous name was 
fixed automatically after solving 1.

So what I think is happening: xCAT does not read from it’s database to run 
makedns. It depends on /etc/hosts… the public network still exists in the lsdef 
-t network command:

[root@hpclab ~]# lsdef -t network
10_0_0_0-255_255_255_0  (network)
146_164_29_0-255_255_255_0  (network)

What do I have now? I think I will try a redeploy with those modified settings 
to see what’s happen. More issues will come, I think when deploying FreeIPA in 
a manner that xCAT makedns’ likes.

That’s it for the server part.

Let me ask another thing: how the clients authenticate? I’m using stateless 
images an this is and issue since I can’t run ipa-client-install on the chroot 
image.

I was thinking in running ipa-client-install unattended everytime a node boots. 
But this isn’t really well documented and tested. Some solution may come with 
force re-enrollment:

https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/6/html/identity_management_guide/linux-manual
https://www.freeipa.org/page/V3/Forced_client_re-enrollment

But it’s not tested yet…

Thanks,



On 23 Sep 2019, at 09:00, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Hi Vinícius,

As I said, still some oddities!

Yes, I too failed with what I thought should be the correct explicit zone 
specification but found that zonesub probably did what I was expecting. We need 
a FreeIPA DNS expert to answer that one.

I think some of issue 2 & 3 comes down to having the xCAT networks and nics 
tables set up appropriately. For me, I think I see that nics whose associated 
networks record points to a different nameserver don’t result in an update 
attempt. I get a message of the form:

Ignoring host xxx-lan, it does not belong to any nets defined in networks table 
or the net it belongs to is configured to use an external nameserver.

where xxx is a node that xCAT knows about, with a secondary nic with suffix 
‘-lan’ and  associated with a networks record in turn configured with a 
different nameserver, that nameserver not being the IPA instance in 
/etc/resolv.conf that xCAT is trying to update. I also think this extends to 
any aliases on such nics, which is where I configure the “real” external 
hostname.

But yes, the non-local /etc/hosts entries that I haven’t told xCAT about end up 
with the internal domain appended to their fq-external-dn hostname. I think 
this is a fail-down from what you saw with your issue 1 forward zone failures - 
xCAT fails with just the external name and then tries again with the internal 
domain appended. In practice, it makes no difference until such time as the A 
record IPA returns has the wrong IP and I’ve dealt with it by manually purging 
such entries from IPA. The behaviour is predictable, so fixing it is 
scriptable, not that I’ve done it.

At some point I’ll tell xCAT about everything I’m interested in and see if that 
helps. It may in fact be that I only see such internal-domain-appended updates 
going to IPA for domains not declared in the networks table. The search dom

Re: [xcat-user] Stateless nodes hostkeys

2019-10-11 Thread Vinícius Ferrão via xCAT-user
Thomas take a look at Jarrod’s message. It’s from two days ago.

All those questions are there.

And yes, Ross was talking about host keys. This is an issue with modern 
authentication. Everything is explained there.



Sent from my iPhone

> On 11 Oct 2019, at 12:38, Thomas HUMMEL  wrote:
> 
> On 10/11/19 4:20 PM, Russ Auld wrote:
>> The postscript you want is 'remoteshell'. It will install the _same_ host 
>> keys on all nodes.
> 
> You confirm you are talking about host key, not root user ssh key correct ?
> 
>> If you bake host keys into the image, the sshd daemon will not create new 
>> keys when it starts. Since the host keys are fixed, you can create a 
>> 'ssh_known_hosts' file with entries for each node and distribute it to your 
>> login/submit/bastion hosts.
> 
> I thought about something like this.
> 
> Anyway, playing for the first time with updatenode -F / xdcp -F
> 
> I'm experiencing the following :
> 
> Simple test :
> 
> /opt/test/foobar.txt
> /opt/test/synclists/list.synclist which content is
> 
>/opt/test/foobar.txt -> /root/foobar.txt
> 
> # xdcp maestro-300 -F /opt/test/synclists/list.synclist
> Error: [maestro-xcat]: Noderange missing in command input.
> Error: [maestro-xcat]: Failed to dispatch command to any of the following 
> service nodes: ,maestro-xcat.maestro.pasteur.fr
> 
> Where maestro-xcat is my MN (I don't use SN).
> 
> What am I missing ?
> 
> Thanks
> 
> --
> TH
> 
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Stateless nodes hostkeys

2019-10-11 Thread Vinícius Ferrão via xCAT-user


Sent from my iPhone

> On 11 Oct 2019, at 13:10, Thomas HUMMEL  wrote:
> 
> On 10/11/19 6:02 PM, Vinícius Ferrão via xCAT-user wrote:
>> Thomas take a look at Jarrod’s message. It’s from two days ago.
> 
> Hello,
> 
> I did. Thanks
>> All those questions are there.
>> And yes, Ross was talking about host keys. This is an issue with modern 
>> authentication. Everything is explained there.
> 
> My understanding is that for now, as secureshell is not released yes I indeed 
> want remoteshell.

Yes. But since you’re retailoring your system you should consider the security 
implications of that.

If you want to keep per host keys without any additional infrastructure a 
mechanism should be implemented to keep the keys sane.

Ir you don’t care for security, which is fine depending the case, remote shell 
in fact does what you want but be aware of the consequences. It’s an old script 
and can become a problem: for instance the ed25519 host keys are always 
regerated due to the fact that remoteshell ignores ed25519 keys.

There’s a lot of ideias from Jarrod to reimplement something better. My path is 
to disable remoteshell completely and use SSSD to handle the host keys.

Whatever you choose might work but keep in mind the issues.

Is there’s something else uncovered?



> 
> Thanks
> 
> --
> TH
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] xCAT forcibly disabling SELinux and firewalld

2019-10-12 Thread Vinícius Ferrão via xCAT-user
Jarrod, do you think it’s okay to raise an issue on 
https://github.com/xcat2/xcat-core/issues to request this new meta package?

Thanks,

On 26 Sep 2019, at 03:54, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

I've been considering removing all of that from executing on rpm install (also 
enabling services to start on boot just by installing rpm)

It was added for convenience of not asking to run a setup after install but it 
is inconsistent with general rpm behavior and limits ability to use flags to 
customize behavior.

On the flip side, this would be a change that people would have to learn and 
would surprise new installs.

I might make variant of the xCAT meta package with no auto setup so that people 
won't be surprised unless they opt into the other package.

Looking for thoughts.

For wider information, it doesn't yet have os deployment, but confluent has 
been developing and designing specifically with firewall and selinux in mind, 
as well as trying to mitigate the initial setup complexity that drove us to 
create xcatconfig in the first place.  For example no more tls certs required 
for local access and os import will no longer loop mount isos (one of the 
biggest selinux problems) and avoid rewriting other service etc files in daemon 
context.  More straightforward network usage and a documented set of firewalld 
commands.

From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: Thursday, September 26, 2019 2:27:10 AM
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: [External] [xcat-user] xCAT forcibly disabling SELinux and firewalld

Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce
Permissive
[root@headnode ~]# systemctl status firewalld
● firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor 
preset: enabled)
   Active: inactive (dead)
 Docs: man:firewalld(1)

Sep 26 02:55:55 headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br> 
systemd[1]: Starting firewalld - dynamic firewall daemon...
Sep 26 02:55:56 headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br> 
systemd[1]: Started firewalld - dynamic firewall daemon.
Sep 26 03:09:18 headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br> 
systemd[1]: Stopping firewalld - dynamic firewall daemon...
Sep 26 03:09:21 headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br> 
systemd[1]: Stopped firewalld - dynamic firewall daemon.

There’s a way to avoid this behaviour?

Thanks,

PS: I’m aware of the consequences of firewalld and SELinux in xCAT environments.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] xCAT forcibly disabling SELinux and firewalld

2019-10-14 Thread Vinícius Ferrão via xCAT-user
Thanks Jarrod.

Opened the issue: https://github.com/xcat2/xcat-core/issues/6445

Just for the sake of completude: what’s the difference between the upstream and 
the Lenovo build? Theres nothing explaining on 
hpc.lenovo.com<http://hpc.lenovo.com>.

It appears to be tight with Confluent. I heard that Confluent would eventually 
replace xcatd and become the xCAT 3.0 release. Is this still true?

Thanks.

On 14 Oct 2019, at 09:38, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

I think it is fine, but on the other hand, I can only personally provide such a 
meta package in the lenovo branches.  I could open a pull request but I can't 
guarantee that it would be accepted.


____
From: Vinícius Ferrão 
mailto:fer...@versatushpc.com.br>>
Sent: Saturday, October 12, 2019 11:13 PM
To: Jarrod Johnson
Cc: xCAT Users Mailing list
Subject: Re: [External] [xcat-user] xCAT forcibly disabling SELinux and 
firewalld

Jarrod, do you think it’s okay to raise an issue on 
https://github.com/xcat2/xcat-core/issues to request this new meta package?
[https://avatars3.githubusercontent.com/u/10124414?s=400=4]<https://github.com/xcat2/xcat-core/issues>
Issues · xcat2/xcat-core · GitHub<https://github.com/xcat2/xcat-core/issues>
github.com<http://github.com/>
Code repo for xCAT core packages. Contribute to xcat2/xcat-core development by 
creating an account on GitHub.

Thanks,

On 26 Sep 2019, at 03:54, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

I've been considering removing all of that from executing on rpm install (also 
enabling services to start on boot just by installing rpm)

It was added for convenience of not asking to run a setup after install but it 
is inconsistent with general rpm behavior and limits ability to use flags to 
customize behavior.

On the flip side, this would be a change that people would have to learn and 
would surprise new installs.

I might make variant of the xCAT meta package with no auto setup so that people 
won't be surprised unless they opt into the other package.

Looking for thoughts.

For wider information, it doesn't yet have os deployment, but confluent has 
been developing and designing specifically with firewall and selinux in mind, 
as well as trying to mitigate the initial setup complexity that drove us to 
create xcatconfig in the first place.  For example no more tls certs required 
for local access and os import will no longer loop mount isos (one of the 
biggest selinux problems) and avoid rewriting other service etc files in daemon 
context.  More straightforward network usage and a documented set of firewalld 
commands.
____
From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: Thursday, September 26, 2019 2:27:10 AM
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: [External] [xcat-user] xCAT forcibly disabling SELinux and firewalld

Hello,

When installing xCAT in EL7 with yum install xCAT it’s just put SELinux in 
permissive mode and disables firewalld.

It does not even ask about it. It just does.

[root@headnode ~]# getenforce
Permissive
[root@headnode ~]# systemctl status firewalld
● firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor 
preset: enabled)
   Active: inactive (dead)
 Docs: man:firewalld(1)

Sep 26 02:55:55 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Starting firewalld - dynamic firewall daemon...
Sep 26 02:55:56 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Started firewalld - dynamic firewall daemon.
Sep 26 03:09:18 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Stopping firewalld - dynamic firewall daemon...
Sep 26 03:09:21 
headnode.cluster.iq.ufrj.br<http://headnode.cluster.iq.ufrj.br/> systemd[1]: 
Stopped firewalld - dynamic firewall daemon.

There’s a way to avoid this behaviour?

Thanks,

PS: I’m aware of the consequences of firewalld and SELinux in xCAT environments.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] xdcp -F / updatenode -F "Noderange missing"

2019-10-14 Thread Vinícius Ferrão via xCAT-user
Thomas, 

Do you have all the entries in /etc/hosts and on the DNS? They are redundant, I 
know, but sometimes xCAT picks values from /etc/hosts and sometimes from DNS. 
This is really a problem, but you get used to it.

If you’re unable to create the entries with makehosts, since you said that the 
machine isn’t an xCAT object, you can put the entries manually. xCAT will not 
override.

I assumed this with your feedback, not sure if it’s the problem either.

Thanks,


> On 14 Oct 2019, at 07:14, Thomas HUMMEL  wrote:
> 
> On 10/14/19 11:22 AM, Thomas HUMMEL wrote:
>> On 10/14/19 10:59 AM, Thomas HUMMEL wrote:
>>> following service nodes: ,maestro-xcat.maestro.pasteur.fr
>> Sorry, I juste noticed I had the above typo in my site table :
>> "master",",maestro-xcat.maestro.pasteur.fr",,
>> Now I changed it to
>> "master","maestro-xcat.maestro.pasteur.fr",,
>> I get
>> # xdcp maestro-300 -F /opt/test/synclists/list.synclist
>> Error: [maestro-xcat]: Error from pping
>> But I can pping the node :
>> [root@maestro-xcat opt]# pping maestro-300
>> maestro-300: ping
> 
> I can fix it now but can't quite explain what happens :
> 
> The problem was that pping was not able to pping the master 
> (maestro-xcat.maestro.pasteur.fr) itself.
> 
> I can make it work by
> 
> - either using the ip as the "master" attribute value in the site table
> - or using the non fqdn (maestro-xcat) value in site table
> 
> This is quite confusing as man site mention "The hostname of the xCAT 
> management node, as known by the nodes"
> 
> Note : in any case my maestro-xcat management node is not an xCAT node object 
> itself, which may itself be a bad practice (but it has always worked for me 
> this way)
> 
> Thanks
> 
> --
> TH
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Timezone settings: how it works?

2019-12-19 Thread Vinícius Ferrão via xCAT-user
Nathan you’re right.

We hit another problem now: why genimage does not update the tzdata package.

We are debugging this issue.

Thank you!

Sent from my iPhone

On 13 Dec 2019, at 18:39, Nathan A Besaw  wrote:



The timezone files for the diskless image come from within the rootimg 
directory, not /usr/share/zoneinfo on the management node.

For your situation below, I would expect that the correct file is:
/install/netboot/centos7.6/x86_64/compute/rootimg/usr/share/zoneinfo/America/Sao_Paulo

Check to see if this checksum matches:
md5sum 
/install/netboot/centos7.6/x86_64/compute/rootimg/usr/share/zoneinfo/America/Sao_Paulo


"Vinícius Ferrão via xCAT-user" ---12/13/2019 02:31:42 PM---Hello, I’m with an 
issue with timezone settings of xCAT.

From: "Vinícius Ferrão via xCAT-user" 
To: "xcat-user@lists.sourceforge.net" 
Cc: "Vinícius Ferrão" 
Date: 12/13/2019 02:31 PM
Subject: [EXTERNAL] [xcat-user] Timezone settings: how it works?





Hello,

I’m with an issue with timezone settings of xCAT.

I do have the configuration in place on xCAT:
[root@headnode etc]# lsdef -t site clustersite | grep timezone
   timezone=America/Sao_Paulo

But when I run packimage it gets a timezone file from a place that I don’t know 
which is.

Here’s the md5 of /etc/localtime inside the image:
md5sum /install/netboot/centos7.6/x86_64/compute/rootimg/etc/localtime
346a7b10fbc17af944896bad4d093568

It should be this one:
md5sum /etc/localtime
339ff8dbd88e39f59b2fd4ee57b1fdb1  /etc/localtime

I even tried search for this specific md5 on /usr/share/zoneinfo but I was 
unable to find it:
find . -type f -exec md5sum {} + | grep 346a7b10fbc17af944896bad4d093568

So the question is, from where xCAT gets the zone info when running packimage?

Thanks,


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user




___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] External DNS/makedns/update server

2019-12-21 Thread Vinícius Ferrão via xCAT-user
Hello, Thomas. I’m not from the xCAT development team but I’m curious with your 
question.

In your example you mean that the name server info on the SOA field is not 
available on the NS fields of a given zone, right?

If this is correct I think xCAT is doing right to notify only the name servers 
on the NS fields, because the name server on the SOA record may be unreachable 
from the client sending the updates, depending on the architecture of the 
domain. A truly hidden master is only reachable from its slaves.

Or I’ve lost something that I don’t understand?

Sent from my iPhone

> On 25 Nov 2019, at 11:15, Thomas HUMMEL  wrote:
> 
> Hello,
> 
> From what I've seen, when using an external DNS server, xCAT's 'makedns' 
> command looks for the NS records for the zone and sends them the update.
> 
> What's the reasoning behind this instead of, let's say, sending the update to 
> the MNAME field (primary master) of the SOA record, as for instance the 
> 'nsupdate' command does when no 'server' directive is used ?
> 
> As a matter of fact, in a setup consisting of (with bind/named) :
> 
> - a (hidden) master
> - 2 slaves (only them have the NS records for the zone)
> 
> it leads to unneccessary transfers :
> 
> - xCAT sends update to slave(s)
> - slaves forward the update to the master
> - master notify the slaves
> - records get transfered from master to slaves
> 
> Thanks
> 
> --
> TH
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] Support for ARM booting

2020-02-10 Thread Vinícius Ferrão via xCAT-user
Being a gravedigger here.

I finally had an opportunity to test an ARM machine, but the results was simply 
bad, I’ve documented what happened here: 
https://github.com/xcat2/xcat-core/issues/2946#issuecomment-584172647

Any help is appreciated.

Thanks,


On 24 Oct 2019, at 17:03, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:


Here is a commit:
https://github.com/jjohnson42/xcat-core/commit/926e026236c4f942e775a8ebde1df2d9b186557f

Unfortunately, the platform that drove that work did not turn out to be a 
commercial success.

That plugin/patch did result in successful install to disk at least, just that 
it used more tftp than I would like.

It would be possible to use iPXE as that builds aarch64 and have xnba.pm 
support that as well, but I am at a loss for applicable aarch64 systems for me 
to test/develop on.

From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: Thursday, October 24, 2019 1:46 PM
To: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Cc: Vinícius Ferrão 
mailto:fer...@versatushpc.com.br>>
Subject: [External] [xcat-user] Support for ARM booting

Hello,

I’m adding some ARM compute nodes on an xCAT and I got the bad news: xCAT does 
not support aarch64.

Looking around there’s some information about prototypes:
https://sourceforge.net/p/xcat/mailman/message/35161961/

Theres an ticket about this too here:
https://github.com/xcat2/xcat-core/issues/2946

So the question is: what’s the state of aarch64 today? There’s any workaround, 
even unofficial one to at least boot aarch64 nodes from a standard x86_64 
management node?

Thanks,

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Delete old osimages

2020-02-15 Thread Vinícius Ferrão via xCAT-user
Hi Cassandra, issuing rmdef will remove all the definitions about the image on 
xCAT, right?

But the files will stay on the disk, and they must be manually removed.

Is there’s anything more than /install/*images* to be deleted? Or this is 
sufficient?

Thanks,


On 14 Feb 2020, at 16:35, Casandra H Qiu 
mailto:cxh...@us.ibm.com>> wrote:


we don't use `rmosdistro`,
only need `rmdef` command, you may need to issue multiple time because there 
are multiple osimage name with the same DISTRO


...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



Andrew Loftus ---02/14/2020 02:14:31 PM---> ... it seems like 
rmosdistro with the -f flag did what I expected. I'll have to take that back. Af

From: Andrew Loftus mailto:alof...@illinois.edu>>
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Date: 02/14/2020 02:14 PM
Subject: [EXTERNAL] Re: [xcat-user] Delete old osimages





> ... it seems like rmosdistro with the -f flag did what I expected.
I'll have to take that back. After running rmosdistro, lsdef -t osimage still 
lists the images.  I had to also run "rmdef" (as pointed out by Aaronhcarr and 
Casandra H Qiu).

So, in summary, what is the proper way to delete an old osdistro that was 
created from copycds?

I was hoping there was a single command to do this, but it seems it requires:

1. lsdef -t osimage | grep  | cut -d' ' -f1 | xargs -n1 rmdef -t osimage
2. rmosdistro 

Please add to this list if it is missing anything.

On Fri, Feb 14, 2020 at 1:06 PM Casandra H Qiu 
mailto:cxh...@us.ibm.com>> wrote:

`rmimage` only remove the files under `rootimgdir=` attribute for the 
diskless/netboot image.

`rmdef -t osimage imagename ` should remove the image created by the `copycds`

what kind of error message did u see when u run `rmdef` command?


Casandra Qiu


...
Casandra Hong Qiu
Phone: (845) 433-9291, t/l 293-9291
Office: Building 8, 3-B-04
cxh...@us.ibm.com



Andrew Loftus ---02/14/2020 01:50:59 PM---No, I didnt' try the -f 
flag because it didn't seem like the right approach. If copycds always creat

From: Andrew Loftus mailto:alof...@illinois.edu>>
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Date: 02/14/2020 01:50 PM
Subject: [EXTERNAL] Re: [xcat-user] Delete old osimages




No, I didnt' try the -f flag because it didn't seem like the right approach.

If copycds always creates an "install" image and rmdef doesn't support removing 
the "install" image, then "rmosdistro" will never run without the -f flag. That 
seems broken to me.

That aside, it seems like rmosdistro with the -f flag did what I expected.

On Fri, Feb 14, 2020 at 11:07 AM Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:
Have you tried "rmosdistro -f" to force remove ?

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



Andrew Loftus ---02/14/2020 12:00:33 PM---Is there a way to clean 
up old osimages (created from copycds)? I tried "rmosdistro" but that fails

From: Andrew Loftus mailto:alof...@illinois.edu>>
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Date: 02/14/2020 12:00 PM
Subject: [EXTERNAL] [xcat-user] Delete old osimages




Is there a way to clean up old osimages (created from copycds)?

I tried "rmosdistro" but that fails with the message that it is referenced by 
osimages.

I tried "rmimage" to remove the osimages but that fails with the message that 
it will only delete certain image files.

Cheers,
--Andy___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] osimage profile attribute

2020-01-19 Thread Vinícius Ferrão via xCAT-user
I wasn’t aware of this. I thought was a “mandatory” group.

On 16 Jan 2020, at 17:49, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:


In these two examples: "xdsh compute command" and "rpower compute off", the 
target nodes for the command will be from nodegroup "compute", not necessarily 
just the nodes that have "profile=compute" in their definitions.

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



"Vinícius Ferrão via xCAT-user" ---01/16/2020 02:28:54 PM---You 
can use it as a keyword for other commands, for instance: xdsh compute command

From: "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc: "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date: 01/16/2020 02:28 PM
Subject: [EXTERNAL] Re: [xcat-user] osimage profile attribute





You can use it as a keyword for other commands, for instance:

xdsh compute command
rpower compute off

On 16 Jan 2020, at 14:37, Bruce Carpenter 
mailto:bcarpen...@uga.edu>> wrote:

What does the “profile” attribute in osimage (osimage.profile) affect? The 
description just states: “The node usage category. For example compute, 
service.”

Does any of xcat use that attribute for anything, or is it just an extra 
“description”?

--
Bruce Carpenter
System Administrator Spec. EITS - Research Computing
#108 Computing Services
101 Cedar Street
Athens, GA 30602-0001
bcarpen...@uga.edu<mailto:bcarpen...@uga.edu>

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] host based authentication

2020-01-21 Thread Vinícius Ferrão via xCAT-user
I have it working in place.

The process is a little bit cumbersome but it’s woking. I’m using FreeIPA too 
so the hostkeys are stored on LDAP, it's integrated.

The only issue that I’ve is that I needed to disable the remoteshell script 
from xCAT and use another one that we have created and unfortunately all the 
nodes shares the same hostkeys.

I can describe exactly what we have done if you want, but it’s tied to FreeIPA.

> On 21 Jan 2020, at 17:51, Imam Toufique  wrote:
> 
> Hi, 
> 
> Quick question, before I jump in finding my own solution.
> 
> Is there anything in xcat that would allow setting up host based 
> authentication?  I know root can ssh from the mgmt. node to all the nodes in 
> the cluster.  I am referring to user authentication , based on 
> /etc/ssh/known_hosts file, where there is a list of hosts and their 
> respective keys. 
> 
> thanks.
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] osimage profile attribute

2020-01-16 Thread Vinícius Ferrão via xCAT-user
You can use it as a keyword for other commands, for instance:

xdsh compute command
rpower compute off

On 16 Jan 2020, at 14:37, Bruce Carpenter 
mailto:bcarpen...@uga.edu>> wrote:

What does the “profile” attribute in osimage (osimage.profile) affect?  The 
description just states: “The node usage category. For example compute, 
service.”

Does any of xcat use that attribute for anything, or is it just an extra 
“description”?

--
Bruce Carpenter
System Administrator Spec. EITS - Research Computing
#108 Computing Services
101 Cedar Street
Athens, GA 30602-0001
bcarpen...@uga.edu

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] xCAT 2.15.1 released

2020-03-06 Thread Vinícius Ferrão via xCAT-user
Hi Mark, thanks for the release.

May I ask a question?

Regarding this: • Limited support for statelite

This happened due to deprecation of stateless machines with root over NFS from 
Red Hat? Which seems to be the case of xCAT statelite method.

How xCAT handles the stateless method? Can’t something similar be made? Since 
it does not appear to be something standard or default with Enterprise Linux.

Thanks,

On 6 Mar 2020, at 14:37, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:


xCAT version 2.15.1 has been released today. Thank you very much for your 
support!

Please see the release notes for details:
https://github.com/xcat2/xcat-core/wiki/XCAT_2.15.1_Release_Notes

Best Regards!

xCAT Team



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Issues with net-snmp on new CentOS 8 system and xCAT 2.16 install

2020-09-06 Thread Vinícius Ferrão via xCAT-user
Hello,

I don’t know I’m probably missing something, or if things got broken with xCAT 
2.16; after installing CentOS 8.2, the dnf install xCAT command fails:

[root@cluster ~]# dnf install xCAT
CentOS-8 - AppStream
   2.0 MB/s | 5.8 MB 00:02
CentOS-8 - Base 
   2.6 MB/s | 2.2 MB 00:00
CentOS-8 - Extras   
14 kB/s | 7.3 kB 00:00
CentOS-8 - PowerTools   
   872 kB/s | 1.9 MB 00:02
OpenHPC-2 - Base
   731 kB/s | 2.5 MB 00:03
OpenHPC-2 - Updates 
   430  B/s | 257  B 00:00
Extra Packages for Enterprise Linux Modular 8 - x86_64  
43 kB/s | 117 kB 00:02
Extra Packages for Enterprise Linux 8 - x86_64  
96 kB/s | 7.9 MB 01:24
xCAT 2 Core packages
27 kB/s |  30 kB 00:01
xCAT 2 depedencies  
94 kB/s | 104 kB 00:01
ZFS on Linux for EL8 - dkms 
30 kB/s |  48 kB 00:01
Error: 
 Problem: package xCAT-2.16-snap202006161607.x86_64 requires 
perl(xCAT::MsgUtils), but none of the providers can be installed
  - package xCAT-2.16-snap202006161607.x86_64 requires perl(xCAT::Utils), but 
none of the providers can be installed
  - package perl-xCAT-4:2.16-snap202006161607.noarch requires perl(SNMP), but 
none of the providers can be installed
  - cannot install the best candidate for the job
  - nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-10.el8 needed by 
net-snmp-perl-1:5.8-10.el8.x86_64
  - nothing provides net-snmp-libs(x86-64) = 1:5.8-10.el8 needed by 
net-snmp-perl-1:5.8-10.el8.x86_64
  - nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-7.el8 needed by 
net-snmp-perl-1:5.8-7.el8.x86_64
  - nothing provides net-snmp-libs(x86-64) = 1:5.8-7.el8 needed by 
net-snmp-perl-1:5.8-7.el8.x86_64
(try to add '--skip-broken' to skip uninstallable packages or '--nobest' to use 
not only best candidate packages)

xCAT appears to require specific version (=) of net-snmp* when it should be 
major or equal (>=).

Those packages are available but with higher versions:
Installing:
 net-snmp-agent-libs x86_64  1:5.8-14.el8_2.1   
  AppStream  747 k
 net-snmp-libs   x86_64  1:5.8-14.el8_2.1   
  BaseOS 821 k

So, it may be a bug, right?

Thanks,
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Issues with net-snmp on new CentOS 8 system and xCAT 2.16 install

2020-09-07 Thread Vinícius Ferrão via xCAT-user
Thanks Jean, this appears to be a 4 months old bug. Woot.

I did what you’ve said but ended up other missing packages:
[root@headnode ~]# dnf install xCAT
Last metadata expiration check: 0:06:55 ago on Mon 07 Sep 2020 08:18:32 PM -03.
Error: 
 Problem: package xCAT-2.16-snap202006161607.x86_64 requires perl(CGI), but 
none of the providers can be installed
  - package perl-CGI-4.38-2.el8.noarch requires perl(HTML::Entities) >= 3.69, 
but none of the providers can be installed
  - cannot install the best candidate for the job
  - package perl-HTML-Parser-3.72-14.el8.x86_64 is filtered out by modular 
filtering
  - package perl-CGI-4.38-2.el8.noarch is filtered out by exclude filtering
  - package perl-HTML-Parser-3.72-15.module_el8.3.0+416+b5fcb464.x86_64 is 
filtered out by exclude filtering
  - package perl-HTML-Parser-3.72-15.module_el8.3.0+416+dee7bcef.x86_64 is 
filtered out by exclude filtering
(try to add '--skip-broken' to skip uninstallable packages or '--nobest' to use 
not only best candidate packages)


Using the Continous Release repository lead to things like this…

What is strange in the original issue, is that the specific requirement is 
tiered as an equal version, and in CentOS 8.2 the package is available but it’s 
different in the last number:
nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-10.el8 needed by 
net-snmp-perl-1:5.8-10.el8.x86_64
net-snmp-agent-libs x86_64  1:5.8-14.el8_2.1
 AppStream  747 k

So xCAT mandates for 1.5.8-10; when 1.5.8-14 is available.

Thanks,

> On 7 Sep 2020, at 06:14, Jean-Baptiste Denis  wrote:
> 
> You can see the discussion here: 
> https://github.com/xcat2/xcat-core/issues/6715
> 
> Right now, I'm using the suggestion I made (using net-snmp-perl from 
> centos8-stream/Appstream): 
> https://github.com/xcat2/xcat-core/issues/6715#issuecomment-660460514
> 
> Jean-Baptiste
> 
> On 9/6/20 1:50 AM, Vinícius Ferrão via xCAT-user wrote:
>> Hello,
>> I don’t know I’m probably missing something, or if things got broken with 
>> xCAT 2.16; after installing CentOS 8.2, the dnf install xCAT command fails:
>> [root@cluster ~]# dnf install xCAT
>> CentOS-8 - AppStream 
>>   2.0 MB/s | 5.8 MB 00:02
>> CentOS-8 - Base  
>>   2.6 MB/s | 2.2 MB 00:00
>> CentOS-8 - Extras
>>14 kB/s | 7.3 kB 00:00
>> CentOS-8 - PowerTools
>>   872 kB/s | 1.9 MB 00:02
>> OpenHPC-2 - Base 
>>   731 kB/s | 2.5 MB 00:03
>> OpenHPC-2 - Updates  
>>   430  B/s | 257  B 00:00
>> Extra Packages for Enterprise Linux Modular 8 - x86_64   
>>43 kB/s | 117 kB 00:02
>> Extra Packages for Enterprise Linux 8 - x86_64   
>>96 kB/s | 7.9 MB 01:24
>> xCAT 2 Core packages 
>>27 kB/s |  30 kB 00:01
>> xCAT 2 depedencies   
>>94 kB/s | 104 kB 00:01
>> ZFS on Linux for EL8 - dkms  
>>30 kB/s |  48 kB 00:01
>> Error:
>>  Problem: package xCAT-2.16-snap202006161607.x86_64 requires 
>> perl(xCAT::MsgUtils), but none of the providers can be installed
>>   - package xCAT-2.16-snap202006161607.x86_64 requires perl(xCAT::Utils), 
>> but none of the providers can be installed
>>   - package perl-xCAT-4:2.16-snap202006161607.noarch requires perl(SNMP), 
>> but none of the providers can be installed
>>   - cannot install the best candidate for the job
>>   - nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-10.el8 needed by 
>> net-snmp-perl-1:5.8-10.el8.x86_64
>>   - nothing provides net-snmp-libs(x86-64) = 1:5.8-10.el8 needed by 
>> net-snmp-perl-1:5.8-10.el8.x86_64
>>   - nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-7.el8 needed by 
>> net-snmp-perl-1:5.8-7.el8.x86_64
>>   - nothing provides net-snmp-libs(x86-64) = 1:5.8-7.el8 needed by 
>> net-snmp-perl-1:5.8-7.el8.x86_64
>> (try to add '--skip-broken' to skip uninstallable packages or '--nobest' to 
>> use not only best candidate packages)
>> xCAT appears to require specific version (=) of net-snmp* when it should be 
>> major or equal (>=).
>> Those packages are availabl

Re: [xcat-user] confluent 3.0.1 release

2020-09-12 Thread Vinícius Ferrão via xCAT-user
Hi Jarrod,

Indeed Confluent have a lot of cool features, and still using xCAT as basis. 
Mainly the SSH infrastructure and security features are the best additions IMO. 
A lot of folks, including myself, are adapting xCAT to be more secure 
(SELinux/Firewall) and more compliant, changing or removing entire postscripts 
for example, because they are now dated, specially the SSH ones.

With this in mind, aren’t xCAT devs willing to incorporate those changes so 
everybody can benefit from it?

Thanks all.

On 10 Sep 2020, at 16:05, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

For those that may be interested, confluent 3 is out now:
https://hpc.lenovo.com/users/hpc/update/2020/09/10/20brelease.html

This marks the first time that confluent may be used for OS deployment for 
those that are interested.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] Re: Issues with net-snmp on new CentOS 8 system and xCAT 2.16 install

2020-09-12 Thread Vinícius Ferrão via xCAT-user
Hi Jarrod, so what you’re saying is that net-snmp-perl should now be shipped by 
the xCAT team instead of relying on the distribution repositories? This would 
be a permanent solution, right?

On 8 Sep 2020, at 09:21, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

So the general problem is that CentOS/RedHat discontinued packaging 
net-snmp-perl

However, net-snmp-perl always carries an *exact-match* dependency on other 
packages. So net-snmp-libs cannot be upgraded if net-snmp-perl is installed and 
you don't also have an upgrade for net-snmp-perl.

So it requires an effort to continuously stay on top of net-snmp and build all 
the rpms so we are at or ahead of the updates.

For Lenovo, our approach has been to remove the hard requirement on 
net-snmp-perl, so that all of xCAT (with the exception of switch based 
discovery) will work if skipping the net-snmp mess.

Of course switch based discovery is popular, for those people the choices are:
-Blocking updates to net-snmp*/installing net-snmp without the update 
repositories enabled
-Waiting for xCAT repository to catch up
-Building net-snmp* rpms from source to force include the perl support 
(https://git.centos.org/rpms/net-snmp/tree/c8)
-Using confluent for switch based discovery and then feed the macs to xCAT

From: Jean-Baptiste Denis mailto:jbde...@pasteur.fr>>
Sent: Tuesday, September 8, 2020 3:02 AM
To: xcat-user@lists.sourceforge.net<mailto:xcat-user@lists.sourceforge.net> 
mailto:xcat-user@lists.sourceforge.net>>
Subject: [External] Re: [xcat-user] Issues with net-snmp on new CentOS 8 system 
and xCAT 2.16 install

> Using the Continous Release repository lead to things like this…

Your answer makes me think I was lucky at the time I've installed it, which is 
not a good situation.

I'll try to go back to this problem in the next days.

Jean-Baptiste

On 9/8/20 1:31 AM, Vinícius Ferrão via xCAT-user wrote:
> Thanks Jean, this appears to be a 4 months old bug. Woot.
>
> I did what you’ve said but ended up other missing packages:
> [root@headnode ~]# dnf install xCAT
> Last metadata expiration check: 0:06:55 ago on Mon 07 Sep 2020 08:18:32 PM 
> -03.
> Error:
>  Problem: package xCAT-2.16-snap202006161607.x86_64 requires perl(CGI), but 
> none of the providers can be installed
>   - package perl-CGI-4.38-2.el8.noarch requires perl(HTML::Entities) >= 3.69, 
> but none of the providers can be installed
>   - cannot install the best candidate for the job
>   - package perl-HTML-Parser-3.72-14.el8.x86_64 is filtered out by modular 
> filtering
>   - package perl-CGI-4.38-2.el8.noarch is filtered out by exclude filtering
>   - package perl-HTML-Parser-3.72-15.module_el8.3.0+416+b5fcb464.x86_64 is 
> filtered out by exclude filtering
>   - package perl-HTML-Parser-3.72-15.module_el8.3.0+416+dee7bcef.x86_64 is 
> filtered out by exclude filtering
> (try to add '--skip-broken' to skip uninstallable packages or '--nobest' to 
> use not only best candidate packages)
>
>
> Using the Continous Release repository lead to things like this…
>
> What is strange in the original issue, is that the specific requirement is 
> tiered as an equal version, and in CentOS 8.2 the package is available but 
> it’s different in the last number:
> nothing provides net-snmp-agent-libs(x86-64) = 1:5.8-10.el8 needed by 
> net-snmp-perl-1:5.8-10.el8.x86_64
> net-snmp-agent-libs x86_64  1:5.8-14.el8_2.1  
>AppStream  747 k
>
> So xCAT mandates for 1.5.8-10; when 1.5.8-14 is available.
>
> Thanks,
>
>> On 7 Sep 2020, at 06:14, Jean-Baptiste Denis 
>> mailto:jbde...@pasteur.fr>> wrote:
>>
>> You can see the discussion here: 
>> https://github.com/xcat2/xcat-core/issues/6715
>>
>> Right now, I'm using the suggestion I made (using net-snmp-perl from 
>> centos8-stream/Appstream): 
>> https://github.com/xcat2/xcat-core/issues/6715#issuecomment-660460514
>>
>> Jean-Baptiste
>>
>> On 9/6/20 1:50 AM, Vinícius Ferrão via xCAT-user wrote:
>>> Hello,
>>> I don’t know I’m probably missing something, or if things got broken with 
>>> xCAT 2.16; after installing CentOS 8.2, the dnf install xCAT command fails:
>>> [root@cluster ~]# dnf install xCAT
>>> CentOS-8 - AppStream
>>>2.0 MB/s | 5.8 MB 00:02
>>> CentOS-8 - Base 
>>>2.6 MB/s | 2.2 MB 00:00
>>> CentOS-8 - Extras   
>>> 14 kB/s | 7.3 kB 00:00
>>> CentOS-8 - PowerTools 

Re: [xcat-user] xCAT On DELL Power Edge C6420 servers - any current users

2020-10-26 Thread Vinícius Ferrão via xCAT-user
You should guarantee that IPMI over LAN interfaces are enabled on IDRAC. It 
came disabled by default.


Sent from my iPhone

On 26 Oct 2020, at 11:53, Chiu, Peter (STFC,RAL,RALSP)  
wrote:


Dear All,

We are thinking of adding a DELL Power Edge C6400 server
(a 6400 chassis with 4 x C6420 PowerEdge Units each with
2 Intel Xeon Gold 6230 CPUs, 128GB memory and iDRAC Enterprise)
to our xCAT cluster in diskless mode.

Before investing on this hardware, I wonder if anyone has any
experience with this set up and willing to share any watch-outs.

Many thanks.

Regards,
Peter Chiu
STFC RAL Space

This email and any attachments are intended solely for the use of the named 
recipients. If you are not the intended recipient you must not use, disclose, 
copy or distribute this email or any of its attachments and should notify the 
sender immediately and delete this email from your system. UK Research and 
Innovation (UKRI) has taken every reasonable precaution to minimise risk of 
this email or any attachments containing viruses or malware but the recipient 
should carry out its own virus and malware checks before opening the 
attachments. UKRI does not accept any liability for any losses or damages which 
the recipient may sustain due to presence of any viruses. Opinions, conclusions 
or other information in this message and attachments that are not related 
directly to UKRI business are solely those of the author and do not represent 
the views of UKRI.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] xCAT On DELL Power Edge C6420 servers - any current users

2020-10-26 Thread Vinícius Ferrão via xCAT-user
After delivery. There’s an iDRAC option for that:

[cid:257BA329-A747-4F93-A254-0B7A0239ACB8]

On 26 Oct 2020, at 14:25, Chiu, Peter (STFC,RAL,RALSP) 
mailto:peter.c...@stfc.ac.uk>> wrote:

Thanks a lot, Vinicius,

“Enabling IPMI over LAN interfaces on IDRAC”, is that something
needs to be done by DELL during manufacturing, or something that
can be enabled after delivery?

Regards,
Peter
From: Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Sent: 26 October 2020 15:41
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc: Vinícius Ferrão 
mailto:fer...@versatushpc.com.br>>
Subject: Re: [xcat-user] xCAT On DELL Power Edge C6420 servers - any current 
users

You should guarantee that IPMI over LAN interfaces are enabled on IDRAC. It 
came disabled by default.

Sent from my iPhone


On 26 Oct 2020, at 11:53, Chiu, Peter (STFC,RAL,RALSP) 
mailto:peter.c...@stfc.ac.uk>> wrote:

Dear All,

We are thinking of adding a DELL Power Edge C6400 server
(a 6400 chassis with 4 x C6420 PowerEdge Units each with
2 Intel Xeon Gold 6230 CPUs, 128GB memory and iDRAC Enterprise)
to our xCAT cluster in diskless mode.

Before investing on this hardware, I wonder if anyone has any
experience with this set up and willing to share any watch-outs.

Many thanks.

Regards,
Peter Chiu
STFC RAL Space

This email and any attachments are intended solely for the use of the named 
recipients. If you are not the intended recipient you must not use, disclose, 
copy or distribute this email or any of its attachments and should notify the 
sender immediately and delete this email from your system. UK Research and 
Innovation (UKRI) has taken every reasonable precaution to minimise risk of 
this email or any attachments containing viruses or malware but the recipient 
should carry out its own virus and malware checks before opening the 
attachments. UKRI does not accept any liability for any losses or damages which 
the recipient may sustain due to presence of any viruses. Opinions, conclusions 
or other information in this message and attachments that are not related 
directly to UKRI business are solely those of the author and do not represent 
the views of UKRI.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] UEFI boot configuration for xCat?

2021-01-04 Thread Vinícius Ferrão via xCAT-user
xCAT already supports UEFI out of the box when netboot=xnba is used.

If you are with issues during boot you should manually try the drop in 
replacement files mentioned in this bug: https://github.com/xcat2/xNBA/issues/2

After the changes if it works please report back in the bug so the updated 
version can be merged on stable xCAT tree.

Sent from my iPhone

On 4 Jan 2021, at 02:35, Sandra Maksimovic  
wrote:


Hi,

I was wondering if there is a guide on configuring xCat to successfully PXE 
boot UEFI enabled systems?

Thanks,
Sandra



This e-mail and any attachments to it (the "Communication") are, unless 
otherwise stated, confidential, may contain copyright material and is for the 
use only of the intended recipient. If you receive the Communication in error, 
please notify the sender immediately by return e-mail, delete the Communication 
and the return e-mail, and do not read, copy, retransmit or otherwise deal with 
it. Any views expressed in the Communication are those of the individual sender 
only, unless expressly stated to be those of Murdoch Children’s Research 
Institute (MCRI) ABN 21 006 566 972 or any of its related entities. MCRI does 
not accept liability in connection with the integrity of or errors in the 
Communication, computer virus, data corruption, interference or delay arising 
from or in respect of the Communication.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-15 Thread Vinícius Ferrão via xCAT-user
I was able to at lease top /etc/resolv.conf from being overwritten at every 
reboot with the following file:

# cat /etc/NetworkManager/conf.d/90-dns-none.conf
[main]
dns=none

I added this to the synclists and we are good about the /etc/resolv.conf isso. 
The conclusion is that NetworkManager was doing something wrong on 
/etc/resolv.conf.

Although that was fixed with a hack there's consequences of it, the hostname of 
the machine is set as localhost.localdomain, and I don't know how to fix it. Is 
there any option in the node table to set the default hostname? So 
confignetwork can do it's job?

# lsdef login
Object name: login
arch=x86_64
bmc=172.25.255.253
bmcpassword=calvin
bmcusername=root
cons=ipmi
consoleenabled=1
currchain=boot
currstate=install ol8.4.0-x86_64-compute
groups=login,all
ip=172.26.255.253
mac=2c:ea:7f:92:aa:d9
mgt=ipmi
netboot=xnba
nicdevices.bond0=ens1f0np0|ens1f1np1
nicdevices.bond0.1010=bond0
nichostnamesuffixes.bond0.1010=-ceph
nicips.ib0=172.27.255.253
nicips.eno1=XXX.XXX.XXX.XXX
nicips.bond0=172.26.255.253
nicips.bond0.1010=10.0.255.253
nicnetworks.ib0=application
nicnetworks.eno1=site
nicnetworks.bond0=management
nicnetworks.bond0.1010=ceph
nictypes.ens1f1np1=ethernet
nictypes.bond0=bond
nictypes.eno1=ethernet
nictypes.ib0=Infiniband
nictypes.bond0.1010=vlan
nictypes.ens1f0np0=ethernet
os=ol8.4.0

postbootscripts=otherpkgs,versatushpc/openpbs-login,versatushpc/fix-ohpc-login

postscripts=syslog,remoteshell,syncfiles,confignetwork,versatushpc/postinstall-login
profile=compute
provmethod=ol8.4.0-x86_64-install-login
serialport=0
serialspeed=115200
status=powering-on
statustime=06-15-2021 16:29:52
updatestatus=failed
updatestatustime=06-15-2021 16:27:27

Thanks,
Vinícius.

On 14 Jun 2021, at 13:48, Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

Hi Thomas,

There's a pattern that I've found. When the compute node is simple enough it 
works, probably da data for resolv.conf is fetched directly from DHPC which 
should be configured correctly.

The issue is around the nodes that have custom network schemes, like bonds and 
VLANs; it's something wrong during the confignetwork postscript. Probably due 
to a configuration mistake that I've made but I don't know which one.

Regarding your questions:

1) It does not exist

[root@ceph01-ib0 ~]# systemctl status systemd-networkd
Unit systemd-networkd.service could not be found.

2) It's running

[root@ceph01-ib0 ~]# systemctl status NetworkManager
● NetworkManager.service - Network Manager
   Loaded: loaded (/usr/lib/systemd/system/NetworkManager.service; enabled; 
vendor preset: enabled)
   Active: active (running) since Mon 2021-06-14 13:37:20 -03; 8min ago
 Docs: man:NetworkManager(8)
 Main PID: 2028 (NetworkManager)
Tasks: 3 (limit: 2464038)
   Memory: 11.4M
   CGroup: /system.slice/NetworkManager.service
   └─2028 /usr/sbin/NetworkManager --no-daemon

3) It does not exist:

[root@ceph01-ib0 ~]# ls -l /etc/resolv.conf
-rw-r--r-- 1 root root 65 Jun 14 13:37 /etc/resolv.conf
[root@ceph01-ib0 ~]# ls -l /run/systemd/resolv/resolv.conf
ls: cannot access '/run/systemd/resolv/resolv.conf': No such file or directory

Cannot find anything related to rc-manager, is this a systemd thing?

4) No it's not.

[root@ceph01-ib0 ~]# ls -l /etc/resolv.conf
-rw-r--r-- 1 root root 65 Jun 14 13:37 /etc/resolv.conf

5) Seems default to me

[root@ceph01-ib0 ~]# grep host /etc/nsswitch.conf
# Valid databases are: aliases, ethers, group, gshadow, hosts,
# myhostname Use systemd host names
hosts:  files dns myhostname

That's it.

It's probably something messy with confignetwork script, but not sure what.

Thanks,

On 14 Jun 2021, at 07:57, Thomas HUMMEL 
mailto:thomas.hum...@pasteur.fr>> wrote:



On 14/06/2021 07:41, Vinícius Ferrão via xCAT-user wrote:
Hello,
For unknown reasons nodes that I've installed with rinstall (using stateful 
method) didn't get the nameserver section in resolv.conf, basically leaving the 
node without any name resolution.

Hello,

assuming it is not an xCAT bug, I would look at

1) if systemd-networkd is enabled
2) if NetworkManager is enabled
3) if b) if it handles /etc/resolv.conf by looking at its conf and
   a) is dns= stated ?
   b) is /etc/resolv.conf a symlink to /run/systemd/resolv/resolv.conf ?
   c) is rc-manager stated ?
4) is /etc/resolv.conf a symlink to ../run/resolvconf/resolv.conf ?
5) the host line of /etc/nsswitch.conf

to figure out who manages /etc/resolv.conf

Hope it helps.

--
Thomas HUMMEL


rc-manager=
As specified on the documentation 
https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html<https://urldefense.com/v3/__https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html__;!!

Re: [xcat-user] [ext] /etc/resolv.conf missing nameserver on install nodes

2021-06-14 Thread Vinícius Ferrão via xCAT-user
Hi Georgios,

Thanks for the tip. I've never heard of mkresolvconf in more than 5 years using 
xCAT. But yes, it does generate the /etc/resolv.conf correctly when issuing the 
updatenode command, but not while the machine is booting.

Something may be overwriting the /etc/resolv.conf file.

But there's stil a tricky behaviour. The compunte node is selecting it's name 
from the Infiniband interface, which is incorrect. It should use it's name from 
the management interface. That was happening before the mkresolvconf trick 
anyway, and again, I don't have any ideia why this is happening.

Thanks,
Vinícius.


On 14 Jun 2021, at 07:10, Nikolis, Georgios 
mailto:georgios.niko...@charite.de>> wrote:

I think you need to add the mkresolvconf postscript into the node's 
postbootscripts.
Does "updatenode  -P mkresolvconf" repair nameserver configuration on an 
affected node?
Cheers,
Georgios


--

Georgios Nikolis
Charité – Universitätsmedizin Berlin
Geschäftsbereich IT | Scientific Computing

Campus Charité Mitte
Rahel-Hirsch-Weg 5 | Ebene 02 | Raum 073
Charitéplatz 1 | 10117 Berlin

georgios.niko...@charite.de<mailto:georgios.niko...@charite.de>
https://www.charite.de<https://www.charite.de/>


On Mon, 2021-06-14 at 05:41 +, Vinícius Ferrão via xCAT-user wrote:
Hello,

For unknown reasons nodes that I've installed with rinstall (using stateful 
method) didn't get the nameserver section in resolv.conf, basically leaving the 
node without any name resolution.

As specified on the documentation 
https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html;
 it should be generated it nameservers and domain are provided on the site 
table: The resolv.conf files for the compute nodes will be created 
automatically using the domain and nameservers values set in the xCAT network 
or site definition.

Both are defined but it still didn't generate it correctly.

[root@headnode ~]# lsdef -t site clustersite | egrep "nameserver|forward|domain"
domain=cluster.domain.tld
forwarders=1.1.1.1
nameservers=172.26.255.254

I even tried adding the nameservers to the network definition, but it was a no 
go:

[root@headnode ~]# lsdef -t network management
Object name: management
gateway=
mask=255.255.0.0
mgtifname=bond0
mtu=1500
nameservers=172.26.255.254
net=172.26.0.0
tftpserver=

Is there anything that I can do to debug this?

Thanks,
Vinícius.

PS: Here's full data from a given node and the networks.

[root@headnode ~]# lsdef ceph01
Object name: ceph01
arch=x86_64
bmc=172.25.254.1
bmcpassword=calvin
bmcusername=root
cons=ipmi
consoleenabled=1
currchain=boot
currstate=install ol8.4.0-x86_64-compute
groups=ceph,all
ip=172.26.254.1
mac=bc:97:e1:ea:08:b0
mgt=ipmi
netboot=xnba
nicdevices.bond0.123=bond0
nicdevices.bond0.1010=bond0
nicdevices.bond0=ens1f0np0|ens1f1np1
nichostnamesuffixes.bond0.1010=-ceph
nichostnamesuffixes.bond0.123=-cephsync
nicips.ib0=172.27.254.1
nicips.bond0=172.26.254.1
nicips.bond0.1010=10.0.10.21
nicips.bond0.123=192.168.168.21
nicnetworks.bond0.123=ceph-sync
nicnetworks.ib0=application
nicnetworks.bond0.1010=ceph
nicnetworks.bond0=management
nictypes.ib0=Infiniband
nictypes.ens1f0np0=ethernet
nictypes.bond0.1010=vlan
nictypes.bond0=bond
nictypes.ens1f1np1=ethernet
nictypes.bond0.123=vlan
os=ol8.4.0
postbootscripts=otherpkgs,confignics

postscripts=syslog,remoteshell,syncfiles,confignetwork,versatushpc/postinstall-ceph
profile=compute
provmethod=ol8.4.0-x86_64-install-ceph
serialport=0
serialspeed=115200
status=booted
statustime=06-14-2021 02:37:04
updatestatus=synced
updatestatustime=06-14-2021 02:01:55

[root@headnode ~]# lsdef -t network
application  (network)
ceph  (network)
ceph-sync  (network)
libvirt  (network)
management  (network)
service  (network)
site  (network)



___

xCAT-user mailing list

<mailto:xCAT-user@lists.sourceforge.net>

xCAT-user@lists.sourceforge.net


<https://lists.sourceforge.net/lists/listinfo/xcat-user>

https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [ext] /etc/resolv.conf missing nameserver on install nodes

2021-06-14 Thread Vinícius Ferrão via xCAT-user
.254.1"
[I]: >> NETMASK="255.255.0.0"
[I]: >> NAME="bond0"

configure nic and its device : bond0.1010 bond0
[I]: create_vlan_interface ifname=bond0 vlanid=1010
[I]: Pickup xcatnet, "ceph", from NICNETWORKS for interface "bond0".
[I]: ip link add link bond0 name bond0.1010 type vlan id 1010
RTNETLINK answers: Numerical result out of range
[I]: ip link set bond0.1010 up
[I]: create_persistent_ifcfg ifname=bond0.1010 xcatnet=ceph 
inattrs=ONBOOT=yes,USERCTL=no,VLAN=yes,MTU=9216
['ifcfg-bond0.1010']
[I]: >> ONBOOT="yes"
[I]: >> USERCTL="no"
[I]: >> VLAN="yes"
[I]: >> MTU="9216"
[I]: >> DEVICE="bond0.1010"
[I]: >> BOOTPROTO="static"
[I]: >> IPADDR="10.0.254.1"
[I]: >> NETMASK="255.255.255.0"
[I]: >> NAME="bond0.1010"

configure nic and its device : bond0.123 bond0
[I]: create_vlan_interface ifname=bond0 vlanid=123
[I]: Pickup xcatnet, "ceph-sync", from NICNETWORKS for interface "bond0".
[I]: ip link add link bond0 name bond0.123 type vlan id 123
RTNETLINK answers: Numerical result out of range
[I]: ip link set bond0.123 up
[I]: State of "bond0.123" was "UNKNOWN" instead of expected "UP". Wait 0 of 200 
with interval 1.
[I]: create_persistent_ifcfg ifname=bond0.123 xcatnet=ceph-sync 
inattrs=ONBOOT=yes,USERCTL=no,VLAN=yes,MTU=9216
['ifcfg-bond0.123']
[I]: >> ONBOOT="yes"
[I]: >> USERCTL="no"
[I]: >> VLAN="yes"
[I]: >> MTU="9216"
[I]: >> DEVICE="bond0.123"
[I]: >> BOOTPROTO="static"
[I]: >> IPADDR="192.168.168.21"
[I]: >> NETMASK="255.255.255.0"
[I]: >> NAME="bond0.123"
Mon Jun 14 14:39:54 -03 2021 [info]: xcat.deployment.postscript: postscript 
confignetwork return with 1

The mkresolvconf trick worked only on first install. After reboot resolve.conf 
was overwrite, probably NetworkManager?

Thanks,
Vinícius.

On 14 Jun 2021, at 14:10, Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

Hi Georgios,

Thanks for the tip. I've never heard of mkresolvconf in more than 5 years using 
xCAT. But yes, it does generate the /etc/resolv.conf correctly when issuing the 
updatenode command, but not while the machine is booting.

Something may be overwriting the /etc/resolv.conf file.

But there's stil a tricky behaviour. The compunte node is selecting it's name 
from the Infiniband interface, which is incorrect. It should use it's name from 
the management interface. That was happening before the mkresolvconf trick 
anyway, and again, I don't have any ideia why this is happening.

Thanks,
Vinícius.


On 14 Jun 2021, at 07:10, Nikolis, Georgios 
mailto:georgios.niko...@charite.de>> wrote:

I think you need to add the mkresolvconf postscript into the node's 
postbootscripts.
Does "updatenode  -P mkresolvconf" repair nameserver configuration on an 
affected node?
Cheers,
Georgios


--

Georgios Nikolis
Charité – Universitätsmedizin Berlin
Geschäftsbereich IT | Scientific Computing

Campus Charité Mitte
Rahel-Hirsch-Weg 5 | Ebene 02 | Raum 073
Charitéplatz 1 | 10117 Berlin

georgios.niko...@charite.de<mailto:georgios.niko...@charite.de>
https://www.charite.de<https://www.charite.de/>


On Mon, 2021-06-14 at 05:41 +, Vinícius Ferrão via xCAT-user wrote:
Hello,

For unknown reasons nodes that I've installed with rinstall (using stateful 
method) didn't get the nameserver section in resolv.conf, basically leaving the 
node without any name resolution.

As specified on the documentation 
https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html;
 it should be generated it nameservers and domain are provided on the site 
table: The resolv.conf files for the compute nodes will be created 
automatically using the domain and nameservers values set in the xCAT network 
or site definition.

Both are defined but it still didn't generate it correctly.

[root@headnode ~]# lsdef -t site clustersite | egrep "nameserver|forward|domain"
domain=cluster.domain.tld
forwarders=1.1.1.1
nameservers=172.26.255.254

I even tried adding the nameservers to the network definition, but it was a no 
go:

[root@headnode ~]# lsdef -t network management
Object name: management
gateway=
mask=255.255.0.0
mgtifname=bond0
mtu=1500
nameservers=172.26.255.254
net=172.26.0.0
tftpserver=

Is there anything that I can do to debug this?

Thanks,
Vinícius.

PS: Here's full data from a given node and the networks.

[root@headnode ~]# l

Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-14 Thread Vinícius Ferrão via xCAT-user
Hi Thomas,

There's a pattern that I've found. When the compute node is simple enough it 
works, probably da data for resolv.conf is fetched directly from DHPC which 
should be configured correctly.

The issue is around the nodes that have custom network schemes, like bonds and 
VLANs; it's something wrong during the confignetwork postscript. Probably due 
to a configuration mistake that I've made but I don't know which one.

Regarding your questions:

1) It does not exist

[root@ceph01-ib0 ~]# systemctl status systemd-networkd
Unit systemd-networkd.service could not be found.

2) It's running

[root@ceph01-ib0 ~]# systemctl status NetworkManager
● NetworkManager.service - Network Manager
   Loaded: loaded (/usr/lib/systemd/system/NetworkManager.service; enabled; 
vendor preset: enabled)
   Active: active (running) since Mon 2021-06-14 13:37:20 -03; 8min ago
 Docs: man:NetworkManager(8)
 Main PID: 2028 (NetworkManager)
Tasks: 3 (limit: 2464038)
   Memory: 11.4M
   CGroup: /system.slice/NetworkManager.service
   └─2028 /usr/sbin/NetworkManager --no-daemon

3) It does not exist:

[root@ceph01-ib0 ~]# ls -l /etc/resolv.conf
-rw-r--r-- 1 root root 65 Jun 14 13:37 /etc/resolv.conf
[root@ceph01-ib0 ~]# ls -l /run/systemd/resolv/resolv.conf
ls: cannot access '/run/systemd/resolv/resolv.conf': No such file or directory

Cannot find anything related to rc-manager, is this a systemd thing?

4) No it's not.

[root@ceph01-ib0 ~]# ls -l /etc/resolv.conf
-rw-r--r-- 1 root root 65 Jun 14 13:37 /etc/resolv.conf

5) Seems default to me

[root@ceph01-ib0 ~]# grep host /etc/nsswitch.conf
# Valid databases are: aliases, ethers, group, gshadow, hosts,
# myhostname Use systemd host names
hosts:  files dns myhostname

That's it.

It's probably something messy with confignetwork script, but not sure what.

Thanks,

On 14 Jun 2021, at 07:57, Thomas HUMMEL 
mailto:thomas.hum...@pasteur.fr>> wrote:



On 14/06/2021 07:41, Vinícius Ferrão via xCAT-user wrote:
Hello,
For unknown reasons nodes that I've installed with rinstall (using stateful 
method) didn't get the nameserver section in resolv.conf, basically leaving the 
node without any name resolution.

Hello,

assuming it is not an xCAT bug, I would look at

1) if systemd-networkd is enabled
2) if NetworkManager is enabled
3) if b) if it handles /etc/resolv.conf by looking at its conf and
   a) is dns= stated ?
   b) is /etc/resolv.conf a symlink to /run/systemd/resolv/resolv.conf ?
   c) is rc-manager stated ?
4) is /etc/resolv.conf a symlink to ../run/resolvconf/resolv.conf ?
5) the host line of /etc/nsswitch.conf

to figure out who manages /etc/resolv.conf

Hope it helps.

--
Thomas HUMMEL


rc-manager=
As specified on the documentation 
https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html<https://urldefense.com/v3/__https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html__;!!JFdNOqOXpB6UZW0!91ZLw8JQX3n5Rscdto49z3zhxcPMupJEn1wtuLVOZFrMI5loio5BEgk3-82bVMwzYliuCA$>;
 it should be generated it nameservers and domain are provided on the site 
table: The resolv.conf files for the compute nodes will be created 
automatically using the domain and nameservers values set in the xCAT network 
or site definition.
Both are defined but it still didn't generate it correctly.
[root@headnode ~]# lsdef -t site clustersite | egrep "nameserver|forward|domain"
domain=cluster.domain.tld
forwarders=1.1.1.1
nameservers=172.26.255.254
I even tried adding the nameservers to the network definition, but it was a no 
go:
[root@headnode ~]# lsdef -t network management
Object name: management
gateway=
mask=255.255.0.0
mgtifname=bond0
mtu=1500
nameservers=172.26.255.254
net=172.26.0.0
tftpserver=
Is there anything that I can do to debug this?
Thanks,
Vinícius.
PS: Here's full data from a given node and the networks.
[root@headnode ~]# lsdef ceph01
Object name: ceph01
arch=x86_64
bmc=172.25.254.1
bmcpassword=calvin
bmcusername=root
cons=ipmi
consoleenabled=1
currchain=boot
currstate=install ol8.4.0-x86_64-compute
groups=ceph,all
ip=172.26.254.1
mac=bc:97:e1:ea:08:b0
mgt=ipmi
netboot=xnba
nicdevices.bond0.123=bond0
nicdevices.bond0.1010=bond0
nicdevices.bond0=ens1f0np0|ens1f1np1
nichostnamesuffixes.bond0.1010=-ceph
nichostnamesuffixes.bond0.123=-cephsync
nicips.ib0=172.27.254.1
nicips.bond0=172.26.254.1
nicips.bond0.1010=10.0.10.21
nicips.bond0.123=192.168.168.21
nicnetworks.bond0.123=ceph-sync
nicnetworks.ib0=application
nicnetworks.bond0.1010=ceph
nicnetworks.bond0=management
nictypes.ib0=Infiniband
nictypes.ens1f0np0=ethernet
nictypes.bond0.1010=vlan
nictypes.bond0=bond
nictypes.ens1f1np1=ethernet
nictypes.bond0.123=vlan
os=ol8.4.0
postbootscripts=otherpkgs,confignics
 
postscripts=syslog

Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-17 Thread Vinícius Ferrão via xCAT-user
Jon thank you man.

I'm looking at the code right now and trying to understand it.

On 17 Jun 2021, at 13:27, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Hi Vinícius,
I've dumped the various parts I think are necessary in the following gist:

https://gist.github.com/JDiprose/f377a97998a971bbe1e591e5afd4e1c6

See particularly the 'installnic' and 'addkcmdline' node settings, and the 
referenced 'confignetwork-jon' postbootscript (small but significant addition 
to the original 'confignetwork') and the locally-written 
rescomp_persistinstallbond postscript. Modifications to the kickstart template 
are also required - see 
compute.centos74.no-ignoredisk.net<http://compute.centos74.no-ignoredisk.net/>-link.tmpl,
 referenced by the osimage.
If I remember correctly, all the important bits for installing on a bond are 
above (execpt the switch config, for which forcing up the pxe link is 
required). The 'addkcmdline' setting configures the bond for anaconda to use, 
the 'network --device=link' in the template tells anaconda to use it, the 
'rescomp_persistinstallbond' replaces the junk anaconda produces with a good 
approximation of what xCAT ought to do and the 'installnic' setting and 
modification to 'confignetwork' stops the 'confignetwork' script from 
destroying the config when run as a postbootscript to configure the secondary 
nics (it otherwise doesn't accept that a nic called bondX can be the install 
nic).
I haven't tested that the modified 'confignetwork' is capable of configuring 
the install nic as a bond in its own right. I don't expect that to work and I 
don't think xCAT has anywhere to hold the relevant config without a significant 
rewrite of 'confignetwork'.
This is obviously for a stateful install. I couldn't tell you anything about 
stateless or statelite.
Finally, apologies to all those whose work I have reused without reference in 
the above - I wasn't expecting to be sharing it. I am happy to answer questions 
on it, and especially to be told better ways to do it!
Jon

--
Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>>
 Tel: 01865 287873
Research Computing Manager
Henry Wellcome Building for Genomic Medicine
Roosevelt Drive, Headington, Oxford OX3 7BN
____
From: Vinícius Ferrão 
[fer...@versatushpc.com.br<mailto:fer...@versatushpc.com.br>]
Sent: 17 June 2021 07:13
To: Jon Diprose
Cc: xCAT Users Mailing list
Subject: Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

Hi Jon, thanks for your time.

You're absolutely correct regarding the network configuration. The node boots 
up with PXE from a single network card and them it creates the LACP bond. On 
the switch side that was achieved with LACP fallback. So yes, ip and 
nicips.bond0 are the same. It's the single interface that is later on bonded.

Regarding NetworkManager, I thought xCAT was using it to configure it's 
network. So there's a way to disable it?

Finally, the networkconfig script works if I add it to the postbootscripts 
section instead of postscripts. In the later I end up with the server without 
any network. Console login is unavaible because root password does not work 
either, so I can't check what happened. Don't know why.

Anyway, here's the files:

Bondies first:
[root@login ~]# cat /etc/sysconfig/network-scripts/ifcfg-xcat-bond-bond0
BONDING_OPTS="mode=802.3ad miimon=100"
TYPE=Bond
BONDING_MASTER=yes
HWADDR=
MTU=1500
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=none
IPADDR=172.26.255.253
PREFIX=16
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
IPV6_ADDR_GEN_MODE=stable-privacy
NAME=xcat-bond-bond0
UUID=c7b9cbeb-2966-4e41-88a5-be212ca4b675
DEVICE=bond0
ONBOOT=yes
AUTOCONNECT_PRIORITY=9
AUTOCONNECT_RETRIES=0
AUTOCONNECT_SLAVES=yes

[root@login ~]# cat 
/etc/sysconfig/network-scripts/ifcfg-xcat-bond-slave-ens1f0np0
MTU=1500
TYPE=Ethernet
NAME=xcat-bond-slave-ens1f0np0
UUID=b9738cbf-908e-4c6f-8dad-29f422f75d11
DEVICE=ens1f0np0
ONBOOT=yes
AUTOCONNECT_PRIORITY=9
AUTOCONNECT_RETRIES=0
MASTER_UUID=c7b9cbeb-2966-4e41-88a5-be212ca4b675
MASTER=bond0
SLAVE=yes

[root@login ~]# cat 
/etc/sysconfig/network-scripts/ifcfg-xcat-bond-slave-ens1f1np1
MTU=1500
TYPE=Ethernet
NAME=xcat-bond-slave-ens1f1np1
UUID=667dc3c7-f2d9-44ea-9342-5e20fe54d8a8
DEVICE=ens1f1np1
ONBOOT=yes
AUTOCONNECT_PRIORITY=9
AUTOCONNECT_RETRIES=0
MASTER_UUID=c7b9cbeb-2966-4e41-88a5-be212ca4b675
MASTER=bond0
SLAVE=yes

This is not supposed to be here:
[root@login ~]# cat /etc/sysconfig/network-scripts/ifcfg-xcat-ens1f0np0
TYPE=Ethernet
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=none
IPADDR=172.26.255.253
PREFIX=16
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
IPV6_ADDR_GEN_MODE=stable-privacy
NAME=xcat-ens1f0np0
UUID=dd90bcec-1520-403e-a384-7edb2a53756b
DEVICE=ens1f0np0
ONBOOT=no
AUTOCONNECT_PRIORITY=9
MTU=1500
GATEWAY=172.26.255.254

That's the WAN i

[xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-13 Thread Vinícius Ferrão via xCAT-user
Hello,

For unknown reasons nodes that I've installed with rinstall (using stateful 
method) didn't get the nameserver section in resolv.conf, basically leaving the 
node without any name resolution.

As specified on the documentation 
https://xcat-docs.readthedocs.io/en/stable/advanced/domain_name_resolution/domain_name_resolution.html;
 it should be generated it nameservers and domain are provided on the site 
table: The resolv.conf files for the compute nodes will be created 
automatically using the domain and nameservers values set in the xCAT network 
or site definition.

Both are defined but it still didn't generate it correctly.

[root@headnode ~]# lsdef -t site clustersite | egrep "nameserver|forward|domain"
domain=cluster.domain.tld
forwarders=1.1.1.1
nameservers=172.26.255.254

I even tried adding the nameservers to the network definition, but it was a no 
go:

[root@headnode ~]# lsdef -t network management
Object name: management
gateway=
mask=255.255.0.0
mgtifname=bond0
mtu=1500
nameservers=172.26.255.254
net=172.26.0.0
tftpserver=

Is there anything that I can do to debug this?

Thanks,
Vinícius.

PS: Here's full data from a given node and the networks.

[root@headnode ~]# lsdef ceph01
Object name: ceph01
arch=x86_64
bmc=172.25.254.1
bmcpassword=calvin
bmcusername=root
cons=ipmi
consoleenabled=1
currchain=boot
currstate=install ol8.4.0-x86_64-compute
groups=ceph,all
ip=172.26.254.1
mac=bc:97:e1:ea:08:b0
mgt=ipmi
netboot=xnba
nicdevices.bond0.123=bond0
nicdevices.bond0.1010=bond0
nicdevices.bond0=ens1f0np0|ens1f1np1
nichostnamesuffixes.bond0.1010=-ceph
nichostnamesuffixes.bond0.123=-cephsync
nicips.ib0=172.27.254.1
nicips.bond0=172.26.254.1
nicips.bond0.1010=10.0.10.21
nicips.bond0.123=192.168.168.21
nicnetworks.bond0.123=ceph-sync
nicnetworks.ib0=application
nicnetworks.bond0.1010=ceph
nicnetworks.bond0=management
nictypes.ib0=Infiniband
nictypes.ens1f0np0=ethernet
nictypes.bond0.1010=vlan
nictypes.bond0=bond
nictypes.ens1f1np1=ethernet
nictypes.bond0.123=vlan
os=ol8.4.0
postbootscripts=otherpkgs,confignics

postscripts=syslog,remoteshell,syncfiles,confignetwork,versatushpc/postinstall-ceph
profile=compute
provmethod=ol8.4.0-x86_64-install-ceph
serialport=0
serialspeed=115200
status=booted
statustime=06-14-2021 02:37:04
updatestatus=synced
updatestatustime=06-14-2021 02:01:55

[root@headnode ~]# lsdef -t network
application  (network)
ceph  (network)
ceph-sync  (network)
libvirt  (network)
management  (network)
service  (network)
site  (network)


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-18 Thread Vinícius Ferrão via xCAT-user
It wont hurt to sync /etc/resolv.conf. Great idea. Will do it.

On 18 Jun 2021, at 05:26, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Sorry, I should have also said that I have /etc/resolv.conf on my syncfiles 
list, so all that setup was to try to get xcat to give me ifcfg files that 
looked as I wanted, and particularly not to be left with an unexpected dhcp-ing 
interface that caused my resolv.conf to be eaten every time the lease renewed. 
I didn't want to meddle with xCAT-provided code more than necessary and I 
wanted to try to keep as much config as possible in the node definition. I 
started off down the track of maybe if I can get anaconda to use the bond as 
the install nic it will all be OK, and that's how I ended up where I did.
Someone else suggested you use the mkresolvconf postscript to provide the 
resolv.conf, which should also work.
Jon

--
Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>>
 Tel: 01865 287873
Research Computing Manager
Henry Wellcome Building for Genomic Medicine
Roosevelt Drive, Headington, Oxford OX3 7BN
____
From: Vinícius Ferrão 
[fer...@versatushpc.com.br<mailto:fer...@versatushpc.com.br>]
Sent: 18 June 2021 06:35
To: Jon Diprose
Cc: xCAT Users Mailing list
Subject: Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

Jon thank you man.

I'm looking at the code right now and trying to understand it.

On 17 Jun 2021, at 13:27, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Hi Vinícius,
I've dumped the various parts I think are necessary in the following gist:

https://gist.github.com/JDiprose/f377a97998a971bbe1e591e5afd4e1c6

See particularly the 'installnic' and 'addkcmdline' node settings, and the 
referenced 'confignetwork-jon' postbootscript (small but significant addition 
to the original 'confignetwork') and the locally-written 
rescomp_persistinstallbond postscript. Modifications to the kickstart template 
are also required - see 
compute.centos74.no-ignoredisk.net<http://compute.centos74.no-ignoredisk.net/>-link.tmpl,
 referenced by the osimage.
If I remember correctly, all the important bits for installing on a bond are 
above (execpt the switch config, for which forcing up the pxe link is 
required). The 'addkcmdline' setting configures the bond for anaconda to use, 
the 'network --device=link' in the template tells anaconda to use it, the 
'rescomp_persistinstallbond' replaces the junk anaconda produces with a good 
approximation of what xCAT ought to do and the 'installnic' setting and 
modification to 'confignetwork' stops the 'confignetwork' script from 
destroying the config when run as a postbootscript to configure the secondary 
nics (it otherwise doesn't accept that a nic called bondX can be the install 
nic).
I haven't tested that the modified 'confignetwork' is capable of configuring 
the install nic as a bond in its own right. I don't expect that to work and I 
don't think xCAT has anywhere to hold the relevant config without a significant 
rewrite of 'confignetwork'.
This is obviously for a stateful install. I couldn't tell you anything about 
stateless or statelite.
Finally, apologies to all those whose work I have reused without reference in 
the above - I wasn't expecting to be sharing it. I am happy to answer questions 
on it, and especially to be told better ways to do it!
Jon

--
Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>>
 Tel: 01865 287873
Research Computing Manager
Henry Wellcome Building for Genomic Medicine
Roosevelt Drive, Headington, Oxford OX3 7BN
________
From: Vinícius Ferrão 
[fer...@versatushpc.com.br<mailto:fer...@versatushpc.com.br>]
Sent: 17 June 2021 07:13
To: Jon Diprose
Cc: xCAT Users Mailing list
Subject: Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

Hi Jon, thanks for your time.

You're absolutely correct regarding the network configuration. The node boots 
up with PXE from a single network card and them it creates the LACP bond. On 
the switch side that was achieved with LACP fallback. So yes, ip and 
nicips.bond0 are the same. It's the single interface that is later on bonded.

Regarding NetworkManager, I thought xCAT was using it to configure it's 
network. So there's a way to disable it?

Finally, the networkconfig script works if I add it to the postbootscripts 
section instead of postscripts. In the later I end up with the server without 
any network. Console login is unavaible because root password does not work 
either, so I can't check what happened. Don't know why.

Anyway, here's the files:

Bondies first:
[root@login ~]# cat /etc/sysconfig/network-scripts/ifcfg-xcat-bond-bond0
BONDING_OPTS="mode=802.3ad miimon=100"
TYPE=Bond
BONDING_MASTER=yes
HWADDR=
MTU=1500
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=none
IPADDR=172.26.255.253
PREFIX=16
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAI

Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

2021-06-17 Thread Vinícius Ferrão via xCAT-user
E fallback interface).

[root@login ~]# cat /etc/sysconfig/network-scripts/ifcfg-ens1f1np1
TYPE=Ethernet
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=dhcp
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
NAME=ens1f1np1
UUID=dac4a797-99fd-4d7f-92e4-8c0c257e9128
DEVICE=ens1f1np1
ONBOOT=no

[root@login ~]# cat /etc/sysconfig/network-scripts/ifcfg-ens1f0np0
# Generated by parse-kickstart
TYPE=Ethernet
DEVICE=ens1f0np0
UUID=beba2419-7903-4f09-8e61-16b51b66d835
ONBOOT=yes
BOOTPROTO=dhcp
IPV6INIT=yes
PROXY_METHOD=none
BROWSER_ONLY=no
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
NAME="System ens1f0np0"

So Jon, what's your suggestion here? Do you mind to share how you fixed this?

Thank you so much.
Vinícius

PS: routes are messed up too:

[root@login ~]# ip r
default via 172.26.255.254 dev eno1 proto static metric 103 <=== this should 
not be here, this is the WAN interface.
default via 172.26.255.254 dev bond0 proto static metric 300
default via 172.26.255.254 dev bond0.1010 proto static metric 400 <=== this 
should not be here, this is a VLAN...
10.0.255.0/24 dev bond0.1010 proto kernel scope link src 10.0.255.253 metric 400
XXX.XXX.XXX.192/26 dev eno1 proto kernel scope link src 143.106.42.244 metric 
103 <=== this should be the default route.
172.26.0.0/16 dev bond0 proto kernel scope link src 172.26.255.253 metric 300
172.26.255.254 dev eno1 proto static scope link metric 103
172.26.255.254 dev bond0.1010 proto static scope link metric 400
172.27.0.0/16 dev ib0 proto kernel scope link src 172.27.255.253 metric 150
192.168.122.0/24 dev virbr0 proto kernel scope link src 192.168.122.1 linkdown

On 16 Jun 2021, at 05:11, Jon Diprose 
mailto:j...@well.ox.ac.uk>> wrote:

Hi Vinícius,
I don't have masses to contribute on this other than that we generally disable 
NetworkManager (why on earth would I want my server dynamically reconfiguring 
its network?) but I am curious about your bonding set-up. Am I right in 
thinking that you are setting up a bond for the primary nic (the one xCAT talks 
to)? It looks like the values for 'ip' and 'nicips.bond0' are the same. I never 
got that to work with the xCAT-supplied postscripts and had to write my own to 
do it, plus specifying some additional install-time kernel params and the 
relevant switch config. So does your way actually generate the correct ifcfg 
files?
To me, your symptoms are consistent with an interface that is still dhcp-ing 
but receiving an empty dns config, or an ifcfg file with an empty "DNS=" param. 
And even with NetworkManager disabled I routinely add "PEERDNS=no" (and 
"DEFROUTE=no") to the nicextraparams.* setting for all secondary nics, though 
it is the route that usually bites me there.
Could you share `ip -o a` and the ifcfg files?
Jon


--
Dr. Jonathan Diprose mailto:j...@well.ox.ac.uk>>
 Tel: 01865 287873
Research Computing Manager
Henry Wellcome Building for Genomic Medicine
Roosevelt Drive, Headington, Oxford OX3 7BN

From: Vinícius Ferrão via xCAT-user 
[xcat-user@lists.sourceforge.net<mailto:xcat-user@lists.sourceforge.net>]
Sent: 16 June 2021 04:15
To: xCAT Users Mailing list
Cc: Vinícius Ferrão
Subject: Re: [xcat-user] /etc/resolv.conf missing nameserver on install nodes

I was able to at lease top /etc/resolv.conf from being overwritten at every 
reboot with the following file:

# cat /etc/NetworkManager/conf.d/90-dns-none.conf
[main]
dns=none

I added this to the synclists and we are good about the /etc/resolv.conf isso. 
The conclusion is that NetworkManager was doing something wrong on 
/etc/resolv.conf.

Although that was fixed with a hack there's consequences of it, the hostname of 
the machine is set as localhost.localdomain, and I don't know how to fix it. Is 
there any option in the node table to set the default hostname? So 
confignetwork can do it's job?

# lsdef login
Object name: login
arch=x86_64
bmc=172.25.255.253
bmcpassword=calvin
bmcusername=root
cons=ipmi
consoleenabled=1
currchain=boot
currstate=install ol8.4.0-x86_64-compute
groups=login,all
ip=172.26.255.253
mac=2c:ea:7f:92:aa:d9
mgt=ipmi
netboot=xnba
nicdevices.bond0=ens1f0np0|ens1f1np1
nicdevices.bond0.1010=bond0
nichostnamesuffixes.bond0.1010=-ceph
nicips.ib0=172.27.255.253
nicips.eno1=XXX.XXX.XXX.XXX
nicips.bond0=172.26.255.253
nicips.bond0.1010=10.0.255.253
nicnetworks.ib0=application
nicnetworks.eno1=site
nicnetworks.bond0=management
nicnetworks.bond0.1010=ceph
nictypes.ens1f1np1=ethernet
nictypes.bond0=bond
nictypes.eno1=ethernet
nictypes.ib0=Infiniband
nictypes.bond0.1010=vlan
nictypes.ens1f0np0=ethernet
os=ol8.4.0

postbootscripts=otherpkgs,versatushpc/openpbs-login,versatushpc/fix-ohpc-login

postscripts=syslog,remotesh

[xcat-user] From where next-server is fetched from?

2021-05-21 Thread Vinícius Ferrão via xCAT-user
Hello, I'm running xCAT development build and makedhcp is falling to proper 
populate next-server on /var/lib/dhcpd/dhcpd.leases:

As an example:

host adano43 {
  dynamic;
  hardware ethernet 00:11:22:33:44:55;
  uid 00:11:22:33:44:55;
  fixed-address 172.26.0.43;
supersede server.ddns-hostname = "adano43";
supersede host-name = "adano43";
if option user-class-identifier = "xNBA" and option client-architecture
 = 00:00 {
  supersede server.always-broadcast = 01;
  supersede server.filename =
  "http://${next-server}:80/tftpboot/xcat/xnba/nodes/adano43;;
} elsif option user-class-identifier = "xNBA" and option
client-architecture = 00:09 {
  supersede server.filename =
  
"http://${next-server}:80/tftpboot/xcat/xnba/nodes/adano43.uefi;;
} elsif option client-architecture = 00:07 {
  supersede server.filename = "xcat/xnba.efi";
} elsif option client-architecture = 00:00 {
  supersede server.filename = "xcat/xnba.kpxe";
} else {
  supersede server.filename = "";
}
}

So to figure it out, someone knows from where xCAT grabs the variable to 
replace with the next-server? I know that this should be the management node or 
maybe a service node, if I was using one.

Thanks,
V.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Stateful provisioning customization

2021-06-06 Thread Vinícius Ferrão via xCAT-user
Alright there's some gotchas.

* pkgdir can only in fact be used for packages available from the distro itself 
(copycds); perhaps updates would be viable?
* I've changed to a scheme like Brian has said: otherpkgs, but it's still 
confusing.

I wasn't able to use my "dnf reposync" mirrors with otherpkgdir, instead I've 
added them just like in the last message, on pkgdir and I was able to install 
the packages that are included with otherpkglist, so here is my definitions 
right now:

# lsdef -t osimage ol8.4.0-x86_64-install-compute
Object name: ol8.4.0-x86_64-install-compute
imagetype=linux
osarch=x86_64
osdistroname=ol8.4.0-x86_64
osname=Linux
osvers=ol8.4.0
otherpkgdir=/install/post/otherpkgs/ol8.4.0/x86_64
otherpkglist=/install/custom/install/compute.otherpkgs.pkglist

pkgdir=/install/ol8.4.0/x86_64,/install/openhpc-2.2/CentOS_8,/install/epel-8/x86_64
pkglist=/install/custom/install/compute.pkglist
profile=compute
provmethod=install
synclists=/install/custom/install/compute.synclist
template=/opt/xcat/share/xcat/install/ol/compute.ol8.tmpl

/install/post/otherpkgs/ol8.4.0/x86_64 is just an empty folder that I've run 
createrepo, with nothing, so dnf/yum will not break in the first boot.

That's the content of: /install/custom/install/compute.otherpkgs.pkglist:
fping
libconfuse
libunwind
ohpc-base-compute
lmod-ohpc

So it seems OK, but again, I'm not sure if this is correct or not. I was trying 
to keep files on otherpkgdir but xCAT can't create proper repositories because 
I probably messed up with the otherpkgs.pkglist. In the first try I created the 
file with the following contents:
epel-8/x86_64/fping
epel-8/x86_64/libconfuse
epel-8/x86_64/libunwind
openhpc-2.2/CentOS_8/updates/x86_64/ohpc-base-compute
openhpc-2.2/CentOS_8/x86_64/lmod-ohpc

But that was a no go. It just didn't worked with xCAT only creating a broken 
repo file that messes the paths of EPEL and OpenHPC in a single URL.

I still have the issue regarding the online repos, but I just sent a rm -f on a 
postscript to "fix" the issue. Definitely no "The Right Way (tm)" to do it.

Thanks again,
Vinícius.

> On 6 Jun 2021, at 00:52, Vinícius Ferrão  wrote:
> 
> Thanks Mark and Brian.
> 
> I'm trying to find my way around, right now I've modified the following:
> 
> ===> observe: pkgdir; pkglist and synclists.
> 
> # lsdef -t osimage ol8.4.0-x86_64-install-compute
> Object name: ol8.4.0-x86_64-install-compute
> imagetype=linux
> osarch=x86_64
> osdistroname=ol8.4.0-x86_64
> osname=Linux
> osvers=ol8.4.0
> otherpkgdir=/install/post/otherpkgs/ol8.4.0/x86_64
> 
> pkgdir=/install/ol8.4.0/x86_64,/install/openhpc-2.2/CentOS_8,/install/epel-8/x86_64
> 
> pkglist=/opt/xcat/share/xcat/install/ol/compute.ol8.pkglist,/install/custom/install/compute.pkglist
> profile=compute
> provmethod=install
> synclists=/install/custom/install/compute.synclist
> template=/opt/xcat/share/xcat/install/ol/compute.ol8.tmpl
> 
> # cat /install/custom/install/compute.pkglist 
> yum-utils
> perl
> fping
> libconfuse
> libunwind
> ohpc-base-compute
> kernel-uek
> lmod-ohpc
> @infiniband
> 
> # cat /install/custom/install/compute.synclist 
> MERGE:
> /etc/passwd -> /etc/passwd
> /etc/group -> /etc/group
> /etc/shadow -> /etc/shadow
> 
> The issue now is that, it seems that pkglist was ignored. I think I should 
> have added the extra pacakges to otherpkgs instead. Right?
> 
> 
> 
> 
> 
> The postscript seems to be ignored too:
> 
> ===> observe: postscripts
> 
> [root@headnode epel-8]# lsdef -t node node01
> Object name: node01
> arch=x86_64
> bmc=172.25.0.1
> bmcpassword=calvin
> bmcusername=root
> cons=ipmi
> consoleenabled=1
> currchain=boot
> currstate=boot
> groups=compute,all
> ip=172.26.0.1
> mac=bc:97:e1:ca:35:10
> mgt=ipmi
> netboot=xnba
> nicips.ib0=172.27.0.1
> nicnetworks.ib0=ib0
> nictypes.ib0=Infiniband
> os=ol8.4.0
> postbootscripts=otherpkgs,confignics
> postscripts=syslog,remoteshell,syncfiles,versatushpc/postinstall
> profile=compute
> provmethod=ol8.4.0-x86_64-install-compute
> serialport=0
> serialspeed=115200
> status=failed
> statustime=06-06-2021 00:11:15
> 
> # cat /install/postscripts/versatushpc/postinstall 
> exec 1> >(logger -s -t xCAT -p local4.info <http://local4.info/>) 2>&1
> 
> # Create directories
> mkdir -p /opt/spack
> mkdir -p /opt/intel
> 
> # Configure limits
> perl -pi -e 's/# End of file/\* soft memlock unlimited\n$&/s' 
> /etc/security/limits.conf
> perl -pi -e 's/# End of file/\* hard memlock u

Re: [xcat-user] Stateful provisioning customization

2021-06-06 Thread Vinícius Ferrão via xCAT-user
nd, you can create any script you want, for 
> example to install extra rpm's after the main os deploys but before the 
> reboot (like say mellanox drivers, that may require a reboot)
> 
> 4.  I don't think so, but again if you need other packages just create a 
> script with a yum command and attach it to the group/node def postbootscript 
> or posctript line (making sure your order is what you want).
> 
> 
> Thanks,
> 
> Brian Joiner
> 
> On 6/4/21 00:30, Vinícius Ferrão via xCAT-user wrote:
> 
> Hello, 
> 
> I'm doing an stateful install right now, and I have some questions to those 
> who use the stateful method. Since I'm already used to stateless provisioning 
> I'm trying to adapt it's concepts to stateful.
> 
> So here we go:
> 
> 1. Can I use syncfiles to issue "updatenode all -F" when needed?
> 
> The ideia is to have a custom file with the synclist and run a command 
> similar to: chdef -t osimage -o ol8.4.0-x86_64-install-compute 
> synclists="/install/custom/install/compute.synclist"
> 
> 2. Otherpkgs works in stateful profile?
> 
> Can I add otherpkgdir and otherpkglist to -install images? It will install 
> the packages during the provision phase? There's any use case for it?
> 
> 3. Where should I do the customization inside the image?
> 
> On stateless I just chroot after "genimage", do whatever I need to do, change 
> confs, enable/disable service, etc, and them "packimage". How can I achieve 
> something similar with stateful nodes?
> 
> 4. Can I have multiple pkglist and otherpkglist files?
> 
> The ideia here is to keep the default ones from xCAT untouched and just add 
> additional ones separated by commas in osimage definition.
> 
> Thanks all.
> 
> PS: I did some reading before, but I was only able to find precise 
> information, I've only found for hierarchical clusters and specific cases 
> like CUDA, and finally I'm not sure which is "The Right Way (tm)" to achieve 
> the functionality mentioned.
> 
> https://xcat-docs.readthedocs.io/en/stable/advanced/hierarchy/provision/diskful_sn.html
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/hierarchy/provision/diskful_sn.html>
> https://xcat-docs.readthedocs.io/en/stable/advanced/gpu/nvidia/osimage/rhels.html#diskless-images
>  
> <https://xcat-docs.readthedocs.io/en/stable/advanced/gpu/nvidia/osimage/rhels.html#diskless-images>
> https://myxcat.readthedocs.io/en/latest/advanced/networks/infiniband/mlnxofed_ib_install_v2_diskful.html?highlight=Infiniband%20Support
>  
> <https://myxcat.readthedocs.io/en/latest/advanced/networks/infiniband/mlnxofed_ib_install_v2_diskful.html?highlight=Infiniband%20%20%20%20%20%20%20%20%20%20%20%20Support>
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net <mailto:xCAT-user@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/xcat-user 
> <https://lists.sourceforge.net/lists/listinfo/xcat-user>
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user 
> <https://lists.sourceforge.net/lists/listinfo/xcat-user> 
> 
> 
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user



smime.p7s
Description: S/MIME cryptographic signature
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Stateful provisioning customization

2021-06-04 Thread Vinícius Ferrão via xCAT-user
Hello,

I'm doing an stateful install right now, and I have some questions to those who 
use the stateful method. Since I'm already used to stateless provisioning I'm 
trying to adapt it's concepts to stateful.

So here we go:

1. Can I use syncfiles to issue "updatenode all -F" when needed?

The ideia is to have a custom file with the synclist and run a command 
similar to: chdef -t osimage -o ol8.4.0-x86_64-install-compute 
synclists="/install/custom/install/compute.synclist"

2. Otherpkgs works in stateful profile?

Can I add otherpkgdir and otherpkglist to -install images? It will 
install the packages during the provision phase? There's any use case for it?

3. Where should I do the customization inside the image?

On stateless I just chroot after "genimage", do whatever I need to do, 
change confs, enable/disable service, etc, and them "packimage". How can I 
achieve something similar with stateful nodes?

4. Can I have multiple pkglist and otherpkglist files?

The ideia here is to keep the default ones from xCAT untouched and just 
add additional ones separated by commas in osimage definition.

Thanks all.

PS: I did some reading before, but I was only able to find precise information, 
I've only found for hierarchical clusters and specific cases like CUDA, and 
finally I'm not sure which is "The Right Way (tm)" to achieve the functionality 
mentioned.

https://xcat-docs.readthedocs.io/en/stable/advanced/hierarchy/provision/diskful_sn.html
 

https://xcat-docs.readthedocs.io/en/stable/advanced/gpu/nvidia/osimage/rhels.html#diskless-images
 

https://myxcat.readthedocs.io/en/latest/advanced/networks/infiniband/mlnxofed_ib_install_v2_diskful.html?highlight=Infiniband%20Support
 




smime.p7s
Description: S/MIME cryptographic signature
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] From where next-server is fetched from?

2021-05-21 Thread Vinícius Ferrão via xCAT-user
LOL. I feel like an idiot right now.

Hmmm thanks Kilian, you're indeed correct. I need to understand now why it 
wasn't working and started working when I manually override the next-server 
variable.

> On 21 May 2021, at 21:22, Kilian Cavalotti  
> wrote:
> 
> I don't think it needs to be expanded by xCAT in the DHCP lease file.
> dhcpd will literally send the
> "http://${next-server}:80/tftpboot/xcat/xnba/nodes/adano43; string in
> the DHCP reply, and the TFTP/PXE client will interpret this using the
> "next-server" value that is defined at the subnet level in dhcpd.conf.
> 
> Some examples here: https://wiki.alpinelinux.org/wiki/PXE_boot#HOWTO
> 
> Cheers,
> --
> Kilian
> 
> On Fri, May 21, 2021 at 4:51 PM Vinícius Ferrão via xCAT-user
>  wrote:
>> 
>> Hello, I'm running xCAT development build and makedhcp is falling to proper 
>> populate next-server on /var/lib/dhcpd/dhcpd.leases:
>> 
>> As an example:
>> 
>> host adano43 {
>>  dynamic;
>>  hardware ethernet 00:11:22:33:44:55;
>>  uid 00:11:22:33:44:55;
>>  fixed-address 172.26.0.43;
>>supersede server.ddns-hostname = "adano43";
>>supersede host-name = "adano43";
>>if option user-class-identifier = "xNBA" and option 
>> client-architecture
>> = 00:00 {
>>  supersede server.always-broadcast = 01;
>>  supersede server.filename =
>>  "http://${next-server}:80/tftpboot/xcat/xnba/nodes/adano43;;
>>} elsif option user-class-identifier = "xNBA" and option
>>client-architecture = 00:09 {
>>  supersede server.filename =
>>  
>> "http://${next-server}:80/tftpboot/xcat/xnba/nodes/adano43.uefi;;
>>} elsif option client-architecture = 00:07 {
>>  supersede server.filename = "xcat/xnba.efi";
>>} elsif option client-architecture = 00:00 {
>>  supersede server.filename = "xcat/xnba.kpxe";
>>} else {
>>  supersede server.filename = "";
>>}
>> }
>> 
>> So to figure it out, someone knows from where xCAT grabs the variable to 
>> replace with the next-server? I know that this should be the management node 
>> or maybe a service node, if I was using one.
>> 
>> Thanks,
>> V.
>> 
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
> 
> 
> 
> -- 
> Kilian
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-03-12 Thread Vinícius Ferrão via xCAT-user
Are you using UEFI or Legacy BIOS?

netboot=pxe or netboot=xnba?


Sent from my iPhone

> On 12 Mar 2021, at 20:42, Calvin Dodge  wrote:
> 
> We are trying to deploy C6420 nodes with a diskfull image.  The
> deployment hangs about about 9.5 seconds after the kernel recognizes
> the network and USB devices.
> 
> Installment details:  Head node was deployed using the live server ISO
> ubuntu-18.04.5-live-server-amd64.iso.
> The xcat-go install process created the /install/OS folder using that
> ISO.  But we could not nodeset a node with that osimage name until we
> ran copycds manually with the regular server ISO
> ubuntu-18.04.5-server-amd64.iso.
> 
> We've seen other strangenesses, like xCAT looking for a folder named
> "ubuntu-" when we tried to run genimage to create a diskless image
> 
> Meanwhile, the nodes are using PERC controllers for storage, which
> uses the megaraid_sas kernel module.  When we unpack the initrd used
> in the initial PXE load, we don't find the kernel module there.  Could
> that be the source of our deployment hang?  We see instructions on
> adding modules, but they don't appear to be relevant, because (1) Dell
> doesn't seem to provide a driver disk for the initial kernel
> (5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
> distribution that doesn't use RPMs.
> 
> We CAN install Ubuntu on the node using the server ISO, so the
> necessary drivers are present there, at least.  So it seems to be an
> xCAT issue.
> 
> Has anyone else encountered this issue?  If not, how can we diagnose
> it, beyond adding (an as yet unfound) megaraid_sas kernel module to
> the initial PXE initrd?
> 
> Calvin Dodge
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Running Copycds on Oracle Linux ISO fails

2021-03-21 Thread Vinícius Ferrão via xCAT-user
Daniel,

I've made a PR to xcat-core so xCAT can properly support Oracle Linux: 
https://github.com/xcat2/xcat-core/pull/6919

You can check it out.

With the demise of CentOS many users don't want CentOS anymore and cannot 
afford RHEL. So as today, March 2021, the only viable option is Oracle Linux, 
there's no Rocky nor Alma Linux yet.

I can confirm that the patch words and I can even boot with Oracle's notorious 
"Unbreakable Enterprise Kernel". Which is nice in an HPC environment.

Regards,


On 1 Mar 2021, at 16:29, Daniel Hilst via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

In fact² this seems the right thing to do

copycds -p /installl/ol8.3/x86_64 -n ol8.3 OracleLinux8.3.iso

I'm not sure if everything will just work, I'm still testing, does anyone can 
confirm if such approach is the right way to do it?

De: Daniel Hilst mailto:dan...@versatushpc.com.br>>
Enviado: segunda-feira, 1 de março de 2021 14:40
Para: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Assunto: RE: Running Copycds on Oracle Linux ISO fails

In fact using -n=centos8.3 works

De: Daniel Hilst
Enviado: segunda-feira, 1 de março de 2021 13:42
Para: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Assunto: Running Copycds on Oracle Linux ISO fails

I'm trying to run copycds on OracleLinux ISO and getting this error



Any ideas? Oracle Linux seems to be supported at 
https://xcat-docs.readthedocs.io/en/stable/overview/features.html

Regards,
Daniel
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Docs on how to build xCAT

2021-03-02 Thread Vinícius Ferrão via xCAT-user
Hello,

I’m trying to build xCAT by myself following the instructions on: 
https://xcat-docs.readthedocs.io/en/latest/developers/guides/code/builds.html

But they are so simple that I figured it out before reading it, just run 
./buildcore.sh

The problem is: that’s not working.

Error:  build of the following RPMs failed:  perl-xCAT xCAT-client xCAT-server 
xCAT-test xCAT-buildkit xCAT-SoftLayer xCAT-vlan xCAT-probe xCAT-openbmc-py

Anyone have the *requirements* to build xCAT? And after the build, how to 
consume the builded packages? The build process will create a repo? And about 
the xCAT-deps? It’s the same thing?

Thank you all.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Docs on how to build xCAT

2021-03-02 Thread Vinícius Ferrão via xCAT-user
I figured out.

In fact there was system (mainly perl) dependencies missing. I have to install 
them:

# dnf -y install perl-Sys-Syslog perl-Pod-Html perl-Time-HiRes perl-Math-BigInt 
perl-Text-Balanced perl-DBI perl-JSON createrepo

The installation was a default EL 8.3 install with Standard tools and 
“Development Tools” package.

Regards,


On 3 Mar 2021, at 01:09, Vinícius Ferrão 
mailto:fer...@versatushpc.com.br>> wrote:

Hi Mark, thanks for the RPMSIGN=0 tip.

It helped but definitely something is still missing, probably dependencies to 
do a proper compilation.

Here’s some issues:

1.
Can't locate Sys/Syslog.pm in @INC (you may need to install the Sys::Syslog 
module) (@INC contains: /usr/lib/perl . /usr/local/lib64/perl5 
/usr/local/share/perl5 /usr/lib64/perl5/vendor_perl 
/usr/share/perl5/vendor_perl /usr/lib64/perl5 /usr/share/perl5) at 
xCAT/MsgUtils.pm line 13.
BEGIN failed--compilation aborted at xCAT/MsgUtils.pm line 13.

2.
Can't locate Pod/Html.pm in @INC (you may need to install the Pod::Html module) 
(@INC contains: /usr/local/lib64/perl5 /usr/local/share/perl5 
/usr/lib64/perl5/vendor_perl /usr/share/perl5/vendor_perl /usr/lib64/perl5 
/usr/share/perl5) at ./xpod2man line 13.
BEGIN failed--compilation aborted at ./xpod2man line 13.

3.
Can't locate Time/HiRes.pm in @INC (you may need to install the Time::HiRes 
module) (@INC contains: /usr/local/lib64/perl5 /usr/local/share/perl5 
/usr/lib64/perl5/vendor_perl /usr/share/perl5/vendor_perl /usr/lib64/perl5 
/usr/share/perl5) at ./share/xcat/tools/detect_dhcpd line 3.

Those are the dependencies that should have been built by xcat-deps or I should 
add it through the package manager (yum/dnf in my case)?

Thanks.


On 2 Mar 2021, at 13:42, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:


Once you clone xcat-core git repository, try running:

./buildcore RPMSIGN=0

It should turn off rpm signing which uses a local directory to find keys.

A core-rpms-snap.tar.bz2 file should be generated somewhere under 
$HOME/xcatbuild


Dependencies are built by builddep.sh script, but it uses a hardcoded local 
directory to find source RPMs. You will need to download those RPMs from 
xcat.org<http://xcat.org/> and point to that location in the script.

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



"Vinícius Ferrão via xCAT-user" ---03/02/2021 10:38:40 AM---Hello, 
I’m trying to build xCAT by myself following the instructions on: 
https://urldefense.proofpoi<https://urldefense.proofpoi/>

From:  "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To:  xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc:  "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date:  03/02/2021 10:38 AM
Subject:  [EXTERNAL] [xcat-user] Docs on how to build xCAT





Hello,

I’m trying to build xCAT by myself following the instructions on: 
https://xcat-docs.readthedocs.io/en/latest/developers/guides/code/builds.html<https://urldefense.proofpoint.com/v2/url?u=https-3A__xcat-2Ddocs.readthedocs.io_en_latest_developers_guides_code_builds.html=DwMGaQ=jf_iaSHvJObTbx-siA1ZOg=cGs4a-jfx6Emudek2Wdi4FLq2XKm1pvrlbCQ-h7Qq7A=6ymTG9ZH0f98bDfvgHctlUW2TaNZja9O2ynkrmVRE6Y=gct2GlQyIQewFrdBxUJ4Bp7H4ObrEKSevUx-wK8_fPc=>

But they are so simple that I figured it out before reading it, just run 
./buildcore.sh

The problem is: that’s not working.

Error: build of the following RPMs failed: perl-xCAT xCAT-client xCAT-server 
xCAT-test xCAT-buildkit xCAT-SoftLayer xCAT-vlan xCAT-probe xCAT-openbmc-py

Anyone have the *requirements* to build xCAT? And after the build, how to 
consume the builded packages? The build process will create a repo? And about 
the xCAT-deps? It’s the same thing?

Thank you all.___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=cGs4a-jfx6Emudek2Wdi4FLq2XKm1pvrlbCQ-h7Qq7A=6ymTG9ZH0f98bDfvgHctlUW2TaNZja9O2ynkrmVRE6Y=jNqA-q8mVhkwqm18QOh1m5DHFiG3kyIGoEGOBUmRTu8=



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Docs on how to build xCAT

2021-03-02 Thread Vinícius Ferrão via xCAT-user
Hi Mark, thanks for the RPMSIGN=0 tip.

It helped but definitely something is still missing, probably dependencies to 
do a proper compilation.

Here’s some issues:

1.
Can't locate Sys/Syslog.pm in @INC (you may need to install the Sys::Syslog 
module) (@INC contains: /usr/lib/perl . /usr/local/lib64/perl5 
/usr/local/share/perl5 /usr/lib64/perl5/vendor_perl 
/usr/share/perl5/vendor_perl /usr/lib64/perl5 /usr/share/perl5) at 
xCAT/MsgUtils.pm line 13.
BEGIN failed--compilation aborted at xCAT/MsgUtils.pm line 13.

2.
Can't locate Pod/Html.pm in @INC (you may need to install the Pod::Html module) 
(@INC contains: /usr/local/lib64/perl5 /usr/local/share/perl5 
/usr/lib64/perl5/vendor_perl /usr/share/perl5/vendor_perl /usr/lib64/perl5 
/usr/share/perl5) at ./xpod2man line 13.
BEGIN failed--compilation aborted at ./xpod2man line 13.

3.
Can't locate Time/HiRes.pm in @INC (you may need to install the Time::HiRes 
module) (@INC contains: /usr/local/lib64/perl5 /usr/local/share/perl5 
/usr/lib64/perl5/vendor_perl /usr/share/perl5/vendor_perl /usr/lib64/perl5 
/usr/share/perl5) at ./share/xcat/tools/detect_dhcpd line 3.

Those are the dependencies that should have been built by xcat-deps or I should 
add it through the package manager (yum/dnf in my case)?

Thanks.


On 2 Mar 2021, at 13:42, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:


Once you clone xcat-core git repository, try running:

./buildcore RPMSIGN=0

It should turn off rpm signing which uses a local directory to find keys.

A core-rpms-snap.tar.bz2 file should be generated somewhere under 
$HOME/xcatbuild


Dependencies are built by builddep.sh script, but it uses a hardcoded local 
directory to find source RPMs. You will need to download those RPMs from 
xcat.org<http://xcat.org/> and point to that location in the script.

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



"Vinícius Ferrão via xCAT-user" ---03/02/2021 10:38:40 AM---Hello, 
I’m trying to build xCAT by myself following the instructions on: 
https://urldefense.proofpoi<https://urldefense.proofpoi/>

From:  "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To:  xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc:  "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date:  03/02/2021 10:38 AM
Subject:  [EXTERNAL] [xcat-user] Docs on how to build xCAT





Hello,

I’m trying to build xCAT by myself following the instructions on: 
https://xcat-docs.readthedocs.io/en/latest/developers/guides/code/builds.html<https://urldefense.proofpoint.com/v2/url?u=https-3A__xcat-2Ddocs.readthedocs.io_en_latest_developers_guides_code_builds.html=DwMGaQ=jf_iaSHvJObTbx-siA1ZOg=cGs4a-jfx6Emudek2Wdi4FLq2XKm1pvrlbCQ-h7Qq7A=6ymTG9ZH0f98bDfvgHctlUW2TaNZja9O2ynkrmVRE6Y=gct2GlQyIQewFrdBxUJ4Bp7H4ObrEKSevUx-wK8_fPc=>

But they are so simple that I figured it out before reading it, just run 
./buildcore.sh

The problem is: that’s not working.

Error: build of the following RPMs failed: perl-xCAT xCAT-client xCAT-server 
xCAT-test xCAT-buildkit xCAT-SoftLayer xCAT-vlan xCAT-probe xCAT-openbmc-py

Anyone have the *requirements* to build xCAT? And after the build, how to 
consume the builded packages? The build process will create a repo? And about 
the xCAT-deps? It’s the same thing?

Thank you all.___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=cGs4a-jfx6Emudek2Wdi4FLq2XKm1pvrlbCQ-h7Qq7A=6ymTG9ZH0f98bDfvgHctlUW2TaNZja9O2ynkrmVRE6Y=jNqA-q8mVhkwqm18QOh1m5DHFiG3kyIGoEGOBUmRTu8=



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Service Nodes requirements and ratios

2021-04-16 Thread Vinícius Ferrão via xCAT-user
Hello,

I'm sorry to ask this repeated question in the mailing list, but I wasn't able 
to find a proper answer with the DuckDuckGo-fu and looking in the mail list 
archive.

What is the recommended ratio of compute nodes per service nodes and when I 
should start considering it instead of leaving everything on the headnode?

Thanks,
V.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] vmware server as XCAT management node?

2021-08-06 Thread Vinícius Ferrão via xCAT-user
Yes it is totally possible and if done right there's actually no drawbacks.

For computing nodes the scenario is different.

On 6 Aug 2021, at 07:11, Molina, Luis 
mailto:luis.molina.p...@dxc.com>> wrote:

Hello all,

My question is regarding virtual nodes. I have read here  
https://github.com/xcat2/xcat-core/issues/5673:

“…All supported scenario are listed 
here
 and all development environments based on physical machine, NOT support 
virtual machine"

So, is it possible that a VMware virtual server acts as xCAT management server, 
managing physical servers (clients) ?

Our systems are RHEL 7.8 (management node and clients)

Thanks

Regards

Luis Molina







DXC Technology Company -- This message is transmitted to you by or on behalf of 
DXC Technology Company or one of its affiliates. It is intended exclusively for 
the addressee. The substance of this message, along with any attachments, may 
contain proprietary, confidential or privileged information or information that 
is otherwise legally exempt from disclosure. Any unauthorized review, use, 
disclosure or distribution is prohibited. If you are not the intended recipient 
of this message, you are not authorized to read, print, retain, copy or 
disseminate any part of this message. If you have received this message in 
error, please destroy and delete all copies and notify the sender by return 
e-mail. Regardless of content, this e-mail shall not operate to bind DXC 
Technology Company or any of its affiliates to any order or other contract 
unless pursuant to explicit written agreement or government initiative 
expressly permitting the use of e-mail for such purpose. --. 
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] ib.rhels8.x86_64.pkglist

2022-01-05 Thread Vinícius Ferrão via xCAT-user
Thanks all for the answers.

Thomas and Jon, very nice answers indeed. It was unrelated on the MLNX OFED 
Repository but this solved some doubts that I had with this repo. I never fully 
understood why things ended up broken when using the repo. The repo should be 
the way to go, but Mellanox is unable to cope with it.

Thanks for the --kmp tip too. I will figure it out the best way.

Best regards,
Vinícius.

On 5 Jan 2022, at 10:17, Nathan A Besaw 
mailto:bes...@us.ibm.com>> wrote:


Here is the package list we are using for MOFED on RHEL8 ppc64le:

# cat /install/xcat_clusters/osimage/rhels/common/pkglist/rhels84/ib.pkglist
# taken from /opt/xcat/share/xcat/ib/netboot/rh/ib.rhels7.ppc64le.pkglist
# Added to by installing MOFED 4.7 on RH8.1
atk
autofs
cairo
crash
createrepo
dracut
ed
emacs
ethtool
gcc
gcc-gfortran
gdb
gdb-headless
git
gtk2
kernel-devel
kernel-rpm-macros
kernel-tools
kernel-tools-libs
kexec-tools
ksh
libcgroup
libcgroup-tools
libhugetlbfs
libnl3
libtool
libtool-ltdl
lsof
m4
make
mutt
numactl
numactl-devel
numactl-libs
pciutils
pciutils-libs
perf
psmisc
python2
python36-devel
redhat-rpm-config
rpm-build
sendmail
sudo
tcl
tcsh
tk
vim-minimal
yum

I think it should work as-is on x86_64, but I have not tried to verify it.

Happy New Year!

"Vinícius Ferrão via xCAT-user" ---01/04/2022 03:59:24 AM---Hello, 
I'm trying to use the OFED install script from xCAT but some template files 
seems to be missi

From: "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To: "xCAT Users Mailing list" 
mailto:xcat-user@lists.sourceforge.net>>
Cc: "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date: 01/04/2022 03:59 AM
Subject: [EXTERNAL] [xcat-user] ib.rhels8.x86_64.pkglist





Hello,

I'm trying to use the OFED install script from xCAT but some template files 
seems to be missing, the first one is the one in the title:

/opt/xcat/share/xcat/ib/netboot/rh/ib.rhels8.x86_64.pkglist

Does anyone have a list of the packages required? EL8 and x86_64?

Thanks all and happy new year.

PS: Anything else that I should be aware with mlnxofed_ib_install script?



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user




___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] ib.rhels8.x86_64.pkglist

2022-01-04 Thread Vinícius Ferrão via xCAT-user
Hello,

I'm trying to use the OFED install script from xCAT but some template files 
seems to be missing, the first one is the one in the title:

/opt/xcat/share/xcat/ib/netboot/rh/ib.rhels8.x86_64.pkglist 

Does anyone have a list of the packages required? EL8 and x86_64?

Thanks all and happy new year.

PS: Anything else that I should be aware with mlnxofed_ib_install script?



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] genimage on old CentOS 7.4

2021-10-28 Thread Vinícius Ferrão via xCAT-user
Hello, I'm trying to update an old CentOS 7.4 cluster but genimage is failing 
during dracut phase:

# chroot /install/netboot/centos7.4/x86_64/compute/rootimg dracut  -f 
/tmp/initrd.2501.gz 3.10.0-1160.45.1.el7.x86_64
No '/dev/log' or 'logger' included for syslog logging
Turning off host-only mode: '/sys' is not mounted!
Turning off host-only mode: '/proc' is not mounted!
Turning off host-only mode: '/run' is not mounted!
Turning off host-only mode: '/dev' is not mounted!
dracut module 'xcat' cannot be found or installed.

I've looked around the web and found a lot of similar issues but no fixes at 
all. Is there anything that I can do instead of running copycds again and 
customizing everything from the ground?

Thanks all,
V.
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] genimage on old CentOS 7.4

2021-10-28 Thread Vinícius Ferrão via xCAT-user
Hi Mark, management is running CentOS 7.9 (upgraded from 7.4). I also upgraded 
xCAT to the latest release.

[root@headnode ~]# cat /etc/redhat-release
CentOS Linux release 7.9.2009 (Core)

[root@headnode ~]# rpm -qa xCAT
xCAT-2.16.2-snap202105201708.x86_64

Thanks.

On 28 Oct 2021, at 11:45, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:


What version of xCAT are you running?
What is the OS of the management node ?

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



"Vinícius Ferrão via xCAT-user" ---10/28/2021 04:58:15 AM---Hello, 
I'm trying to update an old CentOS 7.4 cluster but genimage is failing during 
dracut phase: #

From: "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To: "xCAT Users Mailing list" 
mailto:xcat-user@lists.sourceforge.net>>
Cc: "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date: 10/28/2021 04:58 AM
Subject: [EXTERNAL] [xcat-user] genimage on old CentOS 7.4





Hello, I'm trying to update an old CentOS 7.4 cluster but genimage is failing 
during dracut phase:

# chroot /install/netboot/centos7.4/x86_64/compute/rootimg dracut  -f 
/tmp/initrd.2501.gz 3.10.0-1160.45.1.el7.x86_64
No '/dev/log' or 'logger' included for syslog logging
Turning off host-only mode: '/sys' is not mounted!
Turning off host-only mode: '/proc' is not mounted!
Turning off host-only mode: '/run' is not mounted!
Turning off host-only mode: '/dev' is not mounted!
dracut module 'xcat' cannot be found or installed.

I've looked around the web and found a lot of similar issues but no fixes at 
all. Is there anything that I can do instead of running copycds again and 
customizing everything from the ground?

Thanks all,
V.___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user




___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] genimage on old CentOS 7.4

2021-10-28 Thread Vinícius Ferrão via xCAT-user
Unfortunately not.

Well so the “right thing” would be copycds again. I’m not sure if downgrading 
now would be an option since the headnode is already running 7.9.

Inside the images I was “YOLO” and ran “yum -u update” against public 
repositories.

Thank Mark. Is there’s any other ideia please let me know.

Sent from my iPhone

On 28 Oct 2021, at 16:17, Mark Gurevich  wrote:



Have you tried the genimage on the earlier version of xCAT, before you upgraded 
?
The support for RHEL7.4 started with xCAT 2.13, perhaps that version will work 
better for you.
The current version of xCAT 2.16 has not been tested with 7.4

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



[Inactive hide details for "Vinícius Ferrão via xCAT-user" ---10/28/2021 
12:01:04 PM---Hi Mark, management is running CentOS 7.9]"Vinícius Ferrão via 
xCAT-user" ---10/28/2021 12:01:04 PM---Hi Mark, management is running CentOS 
7.9 (upgraded from 7.4). I also upgraded xCAT to the latest re

From: "Vinícius Ferrão via xCAT-user" 
To: "xCAT Users Mailing list" 
Cc: "Vinícius Ferrão" 
Date: 10/28/2021 12:01 PM
Subject: [EXTERNAL] Re: [xcat-user] genimage on old CentOS 7.4





Hi Mark, management is running CentOS 7.9 (upgraded from 7.4). I also upgraded 
xCAT to the latest release.

[root@headnode ~]# cat /etc/redhat-release
CentOS Linux release 7.9.2009 (Core)

[root@headnode ~]# rpm -qa xCAT
xCAT-2.16.2-snap202105201708.x86_64

Thanks.

On 28 Oct 2021, at 11:45, Mark Gurevich 
mailto:gurev...@us.ibm.com>> wrote:

What version of xCAT are you running?
What is the OS of the management node ?

Mark Gurevich
Poughkeepsie Development Lab
HPC Software Development - xCAT

"If we knew what it was we were doing, it would not be called research, would 
it?"
--Albert Einstein



"Vinícius Ferrão via xCAT-user" ---10/28/2021 04:58:15 AM---Hello, 
I'm trying to update an old CentOS 7.4 cluster but genimage is failing during 
dracut phase: #

From: "Vinícius Ferrão via xCAT-user" 
mailto:xcat-user@lists.sourceforge.net>>
To: "xCAT Users Mailing list" 
mailto:xcat-user@lists.sourceforge.net>>
Cc: "Vinícius Ferrão" 
mailto:fer...@versatushpc.com.br>>
Date: 10/28/2021 04:58 AM
Subject: [EXTERNAL] [xcat-user] genimage on old CentOS 7.4





Hello, I'm trying to update an old CentOS 7.4 cluster but genimage is failing 
during dracut phase:

# chroot /install/netboot/centos7.4/x86_64/compute/rootimg dracut  -f 
/tmp/initrd.2501.gz 3.10.0-1160.45.1.el7.x86_64
No '/dev/log' or 'logger' included for syslog logging
Turning off host-only mode: '/sys' is not mounted!
Turning off host-only mode: '/proc' is not mounted!
Turning off host-only mode: '/run' is not mounted!
Turning off host-only mode: '/dev' is not mounted!
dracut module 'xcat' cannot be found or installed.

I've looked around the web and found a lot of similar issues but no fixes at 
all. Is there anything that I can do instead of running copycds again and 
customizing everything from the ground?

Thanks all,
V.___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user




___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user




___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Confluent 3.3 is released, new stateless implementation

2022-01-05 Thread Vinícius Ferrão via xCAT-user
Jarrod, this seems to bridge the gap between xCAT and Confluent.

Stateless is a long missing feature on Confluent. I'll definitely give it a try 
when I have some spare time.

Is there any hot migrations available (without reinstalling the entire cluster)?

Thanks.

On 5 Jan 2022, at 15:38, Jarrod Johnson 
mailto:jjohns...@lenovo.com>> wrote:

See https://hpc.lenovo.com/users/hpc/update/2021/12/15/21crelease.html for 
details. The big new thing is a new stateless implementation. Of some relevance 
to recent discussions on this list, there’s no ‘geninitrd’, instead the natural 
Dracut in the image builds the initramfs, making for easier customization, or 
at least customization that’s the same for diskless as it would be for diskful.


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Using https pkgdir with rhels8.5.0

2022-01-13 Thread Vinícius Ferrão via xCAT-user
Hello, I'm trying to run genimage with rhels8.5.0 and I've added some https 
repos to pkgdir image definition:

[root@headnode repos]# lsdef -t osimage rhels8.5.0-x86_64-netboot-compute
Object name: rhels8.5.0-x86_64-netboot-compute
exlist=/opt/xcat/share/xcat/netboot/rh/compute.rhels8.x86_64.exlist
imagetype=linux
osarch=x86_64
osdistroname=rhels8.5.0-x86_64
osname=Linux
osvers=rhels8.5.0
otherpkgdir=/install/post/otherpkgs/rhels8.5.0/x86_64
otherpkglist=/install/custom/netboot/compute.otherpkglist
permission=755

pkgdir=/install/rhels8.5.0/x86_64,https://cdn.redhat.com/content/dist/rhel8/8/x86_64/baseos/os,https://cdn.redhat.com/content/dist/rhel8/8/x86_64/appstream/os,https://cdn.redhat.com/content/dist/rhel8/8/x86_64/codeready-builder/os,https://download.fedoraproject.org/pub/epel/8/Everything/x86_64,https://download.fedoraproject.org/pub/epel/8/Modular/x86_64,http://repos.openhpc.community/OpenHPC/2/CentOS_8,http://repos.openhpc.community/OpenHPC/2/updates/CentOS_8
pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels8.x86_64.pkglist

postinstall=/opt/xcat/share/xcat/netboot/rh/compute.rhels8.x86_64.postinstall,/install/custom/netboot/compute.postinstall
profile=compute
provmethod=netboot
rootimgdir=/install/netboot/rhels8.5.0/x86_64/compute
synclists=/install/custom/netboot/compute.synclists

But when I issue genimage there's some issues with curl. It seems something is 
broken with the certificate chain:

Errors during downloading metadata for repository 'rhels8.5.0-x86_64-2':
  - Curl error (60): Peer certificate cannot be authenticated with given CA cert
ificates for https://cdn.redhat.com/content/dist/rhel8/8/x86_64/baseos/os/repoda
ta/repomd.xml [SSL certificate problem: self signed certificate in certificate 
chain]
Error: Failed to download metadata for repo 'rhels8.5.0-x86_64-2': Cannot 
download repomd.xml: Cannot download repodata/repomd.xml: All mirrors were tried
rhels8.5.0-x86_64-3 0.0  B/s |   0  B 00:00
Errors during downloading metadata for repository 'rhels8.5.0-x86_64-3':
  - Curl error (60): Peer certificate cannot be authenticated with given CA 
certificates for 
https://cdn.redhat.com/content/dist/rhel8/8/x86_64/appstream/os/repodata/repomd.xml
 [SSL certificate problem: self signed certificate in certificate chain]
Error: Failed to download metadata for repo 'rhels8.5.0-x86_64-3': Cannot 
download repomd.xml: Cannot download repodata/repomd.xml: All mirrors were tried
rhels8.5.0-x86_64-4 0.0  B/s |   0  B 00:00
Errors during downloading metadata for repository 'rhels8.5.0-x86_64-4':
  - Curl error (60): Peer certificate cannot be authenticated with given CA 
certificates for 
https://cdn.redhat.com/content/dist/rhel8/8/x86_64/codeready-builder/os/repodata/repomd.xml
 [SSL certificate problem: self signed certificate in certificate chain]
Error: Failed to download metadata for repo 'rhels8.5.0-x86_64-4': Cannot 
download repomd.xml: Cannot download repodata/repomd.xml: All mirrors were tried
rhels8.5.0-x86_64-5 7.5 MB/s |  11 MB 00:01
rhels8.5.0-x86_64-6 1.1 MB/s | 979 kB 00:00
rhels8.5.0-x86_64-7 1.7 MB/s | 3.1 MB 00:01
rhels8.5.0-x86_64-8 1.5 MB/s | 2.6 MB 00:01
otherpkgs1  0.0  B/s |   0  B 00:00
Errors during downloading metadata for repository 'otherpkgs1':
  - Curl error (37): Couldn't read a file:// file for 
file:///install/post/otherpkgs/rhels8.5.0/x86_64/repodata/repomd.xml [Couldn't 
open file /install/post/otherpkgs/rhels8.5.0/x86_64/repodata/repomd.xml]
Error: Failed to download metadata for repo 'otherpkgs1': Cannot download 
repomd.xml: Cannot download repodata/repomd.xml: All mirrors were tried

Is there any fix to this curl certificate chain errors?

Thank you.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Versionlock a given package in stateless compute

2022-02-01 Thread Vinícius Ferrão via xCAT-user
Hi guys still on this matter. As today MLNX OFED is becoming a pain to maintain 
with xCAT since perftest from OS needs libefa.so.1 which is unavailable with 
MLNX OFED.

If we have an Stateless image, we cannot run updatenode  -S to deploy 
packages since dnf will be in a broken state due to perftest requirements.

A solution would be to blacklist perftest from the OS, which is basically the 
same question in this thread, to solve the issue.

So if there’s no way to version lock, is there at least a way to blacklist them?

Thanks.

> On 28 Jan 2022, at 20:38, Vinícius Ferrão  wrote:
> 
> Hello,
> 
> I would like to know if there’s a way to versionlock a give package in a 
> stateless compute environment.
> 
> Specifically I would like to fix the redhat-release package version. I tried 
> adding the pinned versions to a pkglist file and running genimage but that 
> was a no go. It just ignored the older packages and installed the newest ones.
> 
> [root@headnode compute]# lsdef -t osimage rhels8.5.0-x86_64-netboot-compute 
> -i pkglist
> Object name: rhels8.5.0-x86_64-netboot-compute
>
> pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels8.x86_64.pkglist,/install/custom/netboot/compute.pkglist
> 
> cat /install/custom/netboot/compute.pkglist
> kernel-4.18.0-305.25.1.el8_4
> kernel-core-4.18.0-305.25.1.el8_4
> kernel-devel-4.18.0-305.25.1.el8_4
> kernel-modules-4.18.0-305.25.1.el8_4
> kernel-modules-extra-4.18.0-305.25.1.el8_4
> kernel-headers-4.18.0-305.25.1.el8_4
> redhat-release-8.4
> 
> During the genimage process it downloaded and installed the correct package, 
> but it was updated later on:
> 
> 
> Package  Arch   Version  Repository   Size
> 
> Installing:
> kernel   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 7.0 M
> kernel-core  x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  38 M
> kernel-devel x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  20 M
> kernel-modules   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  30 M
> kernel-modules-extra x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 7.7 M
> Upgrading:
> kernel-headers   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 8.3 M
> redhat-release   x86_64 8.5-0.8.el8  rhels8.5.0-x86_64-1  44 k
> Transaction Summary
> 
> Install  5 Packages
> Upgrade  2 Packages
> Total size: 110 M
> Downloading Packages:
> Running transaction check
> Transaction check succeeded.
> Running transaction test
> Transaction test succeeded.
> Running transaction
>  Preparing:
> 1/1 
>  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   
> 1/1 
>  Installing   : kernel-core-4.18.0-348.12.2.el8_5.x86_64   
> 1/9 
>  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   
> 1/9 
>  Installing   : kernel-modules-4.18.0-348.12.2.el8_5.x86_64
> 2/9 
>  Running scriptlet: kernel-modules-4.18.0-348.12.2.el8_5.x86_64
> 2/9 
>  Installing   : kernel-4.18.0-348.12.2.el8_5.x86_64
> 3/9 
>  Installing   : kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64  
> 4/9 
>  Running scriptlet: kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64  
> 4/9 
>  Upgrading: kernel-headers-4.18.0-348.12.2.el8_5.x86_64
> 5/9 
>  Upgrading: redhat-release-8.5-0.8.el8.x86_64  
> 6/9 
>  Installing   : kernel-devel-4.18.0-348.12.2.el8_5.x86_64  
> 7/9 
>  Running scriptlet: kernel-devel-4.18.0-348.12.2.el8_5.x86_64  
> 7/9 
>  Cleanup  : kernel-headers-4.18.0-305.25.1.el8_4.x86_64
> 8/9 
>  Cleanup  : redhat-release-8.4-0.6.el8.x86_64  
> 9/9 
>  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   
> 9/9 
> dracut: No '/dev/log' or 'logger' included for syslog logging
> dracut: Turning off host-only mode: '/run' is not mounted!
> dracut: Turning off host-only mode: '/dev' is not mounted!
>  Running scriptlet: redhat-release-8.4-0.6.el8.x86_64  
> 9/9 
>  Verifying: kernel-4.18.0-348.12.2.el8_5.x86_64
> 1/9 
>  Verifying: kernel-devel-4.18.0-348.12.2.el8_5.x86_64  
> 2/9 
>  Verifying: kernel-modules-4.18.0-348.12.2.el8_5.x86_64
> 3/9 
>  Verifying   

[xcat-user] Versionlock a given package in stateless compute

2022-01-28 Thread Vinícius Ferrão via xCAT-user
Hello,

I would like to know if there’s a way to versionlock a give package in a 
stateless compute environment.

Specifically I would like to fix the redhat-release package version. I tried 
adding the pinned versions to a pkglist file and running genimage but that was 
a no go. It just ignored the older packages and installed the newest ones.

[root@headnode compute]# lsdef -t osimage rhels8.5.0-x86_64-netboot-compute -i 
pkglist
Object name: rhels8.5.0-x86_64-netboot-compute

pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels8.x86_64.pkglist,/install/custom/netboot/compute.pkglist

cat /install/custom/netboot/compute.pkglist
kernel-4.18.0-305.25.1.el8_4
kernel-core-4.18.0-305.25.1.el8_4
kernel-devel-4.18.0-305.25.1.el8_4
kernel-modules-4.18.0-305.25.1.el8_4
kernel-modules-extra-4.18.0-305.25.1.el8_4
kernel-headers-4.18.0-305.25.1.el8_4
redhat-release-8.4

During the genimage process it downloaded and installed the correct package, 
but it was updated later on:


 Package  Arch   Version  Repository   Size

Installing:
 kernel   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 7.0 M
 kernel-core  x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  38 M
 kernel-devel x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  20 M
 kernel-modules   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2  30 M
 kernel-modules-extra x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 7.7 M
Upgrading:
 kernel-headers   x86_64 4.18.0-348.12.2.el8_5rhels8.5.0-x86_64-2 8.3 M
 redhat-release   x86_64 8.5-0.8.el8  rhels8.5.0-x86_64-1  44 k
Transaction Summary

Install  5 Packages
Upgrade  2 Packages
Total size: 110 M
Downloading Packages:
Running transaction check
Transaction check succeeded.
Running transaction test
Transaction test succeeded.
Running transaction
  Preparing:1/1 
  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   1/1 
  Installing   : kernel-core-4.18.0-348.12.2.el8_5.x86_64   1/9 
  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   1/9 
  Installing   : kernel-modules-4.18.0-348.12.2.el8_5.x86_642/9 
  Running scriptlet: kernel-modules-4.18.0-348.12.2.el8_5.x86_642/9 
  Installing   : kernel-4.18.0-348.12.2.el8_5.x86_643/9 
  Installing   : kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64  4/9 
  Running scriptlet: kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64  4/9 
  Upgrading: kernel-headers-4.18.0-348.12.2.el8_5.x86_645/9 
  Upgrading: redhat-release-8.5-0.8.el8.x86_64  6/9 
  Installing   : kernel-devel-4.18.0-348.12.2.el8_5.x86_64  7/9 
  Running scriptlet: kernel-devel-4.18.0-348.12.2.el8_5.x86_64  7/9 
  Cleanup  : kernel-headers-4.18.0-305.25.1.el8_4.x86_648/9 
  Cleanup  : redhat-release-8.4-0.6.el8.x86_64  9/9 
  Running scriptlet: kernel-core-4.18.0-348.12.2.el8_5.x86_64   9/9 
dracut: No '/dev/log' or 'logger' included for syslog logging
dracut: Turning off host-only mode: '/run' is not mounted!
dracut: Turning off host-only mode: '/dev' is not mounted!
  Running scriptlet: redhat-release-8.4-0.6.el8.x86_64  9/9 
  Verifying: kernel-4.18.0-348.12.2.el8_5.x86_641/9 
  Verifying: kernel-devel-4.18.0-348.12.2.el8_5.x86_64  2/9 
  Verifying: kernel-modules-4.18.0-348.12.2.el8_5.x86_643/9 
  Verifying: kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64  4/9 
  Verifying: kernel-core-4.18.0-348.12.2.el8_5.x86_64   5/9 
  Verifying: redhat-release-8.5-0.8.el8.x86_64  6/9 
  Verifying: redhat-release-8.4-0.6.el8.x86_64  7/9 
  Verifying: kernel-headers-4.18.0-348.12.2.el8_5.x86_648/9 
  Verifying: kernel-headers-4.18.0-305.25.1.el8_4.x86_649/9 
Installed products updated.
Upgraded:
  kernel-headers-4.18.0-348.12.2.el8_5.x86_64 redhat-release-8.5-0.8.el8.x86_64
Installed:
  kernel-4.18.0-348.12.2.el8_5.x86_64   
  kernel-core-4.18.0-348.12.2.el8_5.x86_64  
  kernel-devel-4.18.0-348.12.2.el8_5.x86_64 
  kernel-modules-4.18.0-348.12.2.el8_5.x86_64   
  kernel-modules-extra-4.18.0-348.12.2.el8_5.x86_64 
Complete!

Any ideias?

Thanks.



smime.p7s
Description: 

Re: [xcat-user] Announcement: xCAT Project End-Of-Life planned for December 1, 2023

2023-09-01 Thread Vinícius Ferrão via xCAT-user
Really sad news.

Sent from my iPhone

On 1 Sep 2023, at 13:52, Nathan A Besaw via xCAT-user 
 wrote:



Mark Gurevich, Peter Wong, and I have been the primary xCAT maintainers for the 
past few years. This year, we have moved on to new roles unrelated to xCAT and 
can no longer continue to support the project. As a result, we plan to archive 
the project on December 1, 2023. xCAT 2.16.5, released on March 7, 2023, is our 
final planned release.

We would consider transitioning responsibility for the project to a new group 
of maintainers if members of the xCAT community can develop a viable proposal 
for future maintenance.

Thank you all for you support of the project over the past 20+ years.


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Error during os provisioning

2022-07-25 Thread Vinícius Ferrão via xCAT-user
Usually those compat tables are BS.

It only means that the hardware was testes against. But it usually work. It’s 
probably other issue.

But let me ask: are you using HW RAID? Or any HBA? Are those SAS2? If yes, this 
may be the issue since SAS2 and backward support has been removed from EL8.

Regards,

On 25 Jul 2022, at 16:51, SOPORTE MODEMAT 
mailto:soporte.mode...@epn.edu.ec>> wrote:

Hi Vinicius

Thank you for your reply.

After I lot of research, I discovered that the ProLiant BL460c G7 server are 
compatible only until Centos 7, this is the source of the information: 
https://access.redhat.com/ecosystem/hardware/941513. I wonder if there is a 
workaround to make them work with Centos 8 in some way.

Here is the post where I asked for help too and the actions 
taken:https://www.linuxquestions.org/questions/showthread.php?p=6368728#post6368728

Kind regards.
Soporte


From: Vinícius Ferrão 
mailto:fer...@versatushpc.com.br>>
Sent: 25 July 2022 10:16
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc: SOPORTE MODEMAT 
mailto:soporte.mode...@epn.edu.ec>>
Subject: Re: [xcat-user] Error during os provisioning

I don’t think this is the real issue, because sda should not have a multipath. 
And the return was OK…

Is there something thrown on /var/log/xCAT that caught your attention?

Do you have console redirection enabled? I’ve seen issues like that before. 
What is the kernel line for the boot on the node?

Regards.

Sent from my iPhone


On 25 Jul 2022, at 12:10, SOPORTE MODEMAT via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:

Hi Guys.

Please help me try to solve this error that appears during os provisioning of 
Centos 8.5 2011, the installation stucks with this error: “Started cancel 
waiting for multipath siblings of sda”, as it is shown in the image below:



The xcat version that I have is the 2.16.4, os: Centos 8.5 2011.

Thank you in advance for your help.

Kind regards

Soporte
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Network boot Memtest with xCAT

2023-02-17 Thread Vinícius Ferrão via xCAT-user
There’s a reemerged version of Memtest that allows UEFI network booting and PXE 
chainloading. So I might try it.

It’s available on memtest.org (it’s no longer the same as memtest.com; they 
diverged).

I will take a look and report back. Let’s see…

Sent from my iPhone

On 17 Feb 2023, at 13:08, Nathan A Besaw  wrote:


Hi Vinícius,

There is no built in feature in xCAT to boot Memtest.

I tried to get it to boot by manually copying files into tftpboot and modifying 
the boot file for my test machine,
following roughly the steps described here:
https://sourceforge.net/p/xcat/mailman/message/34425131/
But I had the same results as desribed in that thread. Maybe something to do 
with the graphics mode? I'm not sure.

The next thing I would suggest to try is to roughly follow these steps 
described for iPXE:
https://ipxe.org/appnote/memtest

xNBA is based on iPXE, so the same process should work, but those instructions 
are from 2015, so they might not be complete in 2023.

If you get a chance to try it out, report back on how it goes.


From: Vinícius Ferrão via xCAT-user 
Sent: Thursday, February 16, 2023 10:42 AM
To: xCAT Users Mailing list 
Cc: Vinícius Ferrão 
Subject: [EXTERNAL] [xcat-user] Network boot Memtest with xCAT

Hello, I was wondering is theres a way to easily boot Memtest (https: //www. 
memtest. org/) from xCAT. Someone already achieved this? Is there any ideia on 
how to do this? Theoretically I should use something like: netboot=memtest; but 
that's
ZjQcmQRYFpfptBannerStart
This Message Is From an Untrusted Sender
You have not previously corresponded with this sender.

ZjQcmQRYFpfptBannerEnd
Hello, I was wondering is theres a way to easily boot Memtest 
(https://www.memtest.org/) from xCAT.

Someone already achieved this?

Is there any ideia on how to do this? Theoretically I should use something 
like: netboot=memtest; but that's probably unspecified.

Thanks.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Network boot Memtest with xCAT

2023-02-16 Thread Vinícius Ferrão via xCAT-user
Hello, I was wondering is theres a way to easily boot Memtest 
(https://www.memtest.org/) from xCAT.

Someone already achieved this?

Is there any ideia on how to do this? Theoretically I should use something 
like: netboot=memtest; but that's probably unspecified.

Thanks.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] The state of Fedora support

2023-03-29 Thread Vinícius Ferrão via xCAT-user
Hello guys, someone have any information on Fedora support? Is it discontinued 
and should no be used anymore os there are plans for Fedora support?

Thanks.

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] BitTorrent distribution of stateless images with xCAT interesting to anyone?

2023-04-01 Thread Vinícius Ferrão via xCAT-user
Lenovo’s provisioner:


Downloads
hpc.lenovo.com
[icon.png]


Sent from my iPhone

On 1 Apr 2023, at 10:41, Tomer Shachaf  wrote:


Can anybody explain me what is confluent?

בברכה ,

תומר שחף | מהנדס אינטגרציה ותשתיות | חטיבת אינטגרציה ותשתיות | מטריקס | נייד 
054-2686841 |
tomers...@matrix.co.il | 
www.matrix.co.il
[image001.jpg]


On 29 Mar 2023, at 20:40, Jarrod Johnson  wrote:


For reference, I did a couple of bittorrent style diskless as a project years 
ago.  Didn't ever mainstream it though.  In the end the performance uplift 
wasn't as noticeable as one might have guessed, for an environment where the 
boot servers had at least 10G.

Note that nowadays I've moved my development attention to confluent.  Also 
note, confluent never pushes private ssh keys (node to node ssh when enabled is 
facilitated through SSH certificate authority and helper to generate 
shosts.equiv).

On confluent diskless, there is an interesting benefit that becomes a challenge 
for bittorrent: a typical diskless node never downloads the whole diskless 
image.  This means less ram sucked up by the diskless image, and also that the 
diskless image can be large without pruning. Further, even the bits 
'downloaded' are permitted to be erased as needed by the kernel memory 
management, so the current expectation is that we don't expend resources on a 
diskless node to retain the image unless we absolutely need it. So a typical 
bittorrent flow would erode this benefit.

One could imagine a bittorrent scenario that would erode less of the value but 
would still come at a price.  If the similar trick were done to only torrent 
the parts as needed locally, then the critical portion for boot would be memory 
resident on each node.  We would still lose the ability for the kernel to free 
up that memory (either as needed or drop_cache), and much of the boot up 
contents do not need to be read again, so dropping their cache after boot can 
offer benefit.

Incidentally, another facet about the diskless image difference between xCAT 
and confluent, the diskless images are now encrypted.  This affords protection 
in case your diskless image contains some sensitive material.  The decryption 
key is available through the confluent API, and is generally authenticated by 
node TPM, so a diskless node persists trust through having the same TPM that 
had been previously authenticated. This fact allows the transport security to 
matter less, though our security policies are pretty insistent that https be 
used at all times.

I would be interested in developing torrent style boot design with confluent, 
with lower hanging fruit of 'untethered' mode, which is still available and 
does download the image (at the expense of ram usage).  Interestingly, the 
logic is no longer inside the packed initramfs, but is loose in the profile.  
The link to RedHat 9 style diskless bootstrap is:
https://github.com/lenovo/confluent/blob/master/confluent_osdeploy/el9-diskless/profiles/default/scripts/imageboot.sh
[https://opengraph.githubassets.com/1f19a279adcddae426f052b5f40da5903b2b87eebc6c45409caf258f36bfab8c/lenovo/confluent]
confluent/imageboot.sh at master · 
lenovo/confluent
Confluent Cluster Management software. Contribute to lenovo/confluent 
development by creating an account on GitHub.
github.com
Notably:

if [ "untethered" = "$(getarg confluent_imagemethod)" ]; then
mount -t tmpfs untethered /mnt/remoteimg
curl 
https://$confluent_whost/confluent-public/os/$confluent_profile/rootimg.sfs -o 
/mnt/remoteimg/rootimg.sfs
else
confluent_urls="$confluent_urls 
https://$confluent_whost/confluent-public/os/$confluent_profile/rootimg.sfs;
/opt/confluent/bin/urlmount $confluent_urls /mnt/remoteimg
fi

Is the logic for getting the image.  One thing to note is that a typical 
diskless image boot in confluent, the booted system does not​ see rootimg.sfs, 
so the torrent execution would have to stay in the 'initramfs' world (which 
does persist after boot, as a separate mount namespace)






From: Dr. Thomas Orgis 
Sent: Wednesday, March 29, 2023 11:37 AM
To: xCAT Users Mailing list 
Subject: [External] [xcat-user] BitTorrent distribution of stateless images 
with xCAT interesting to anyone?

Hi,

I first got into contact with xCAT through our HPC installed in 2015,
with xCAT version … hm …

# nodels --version
Version 2.9.1 (git commit 7f6043fffd62d482931b17b60f9488eb5754fdc1, built Thu 
Mar 19 03:25:35 EDT 2015)

2.9.1 seems to be it. The base 

Re: [xcat-user] rsync on https://xcat.org/files/xcat/repos

2023-06-09 Thread Vinícius Ferrão via xCAT-user
Any updates on this matter?

Can we get it guys?

Thanks.

On 24 May 2023, at 00:32, Vinícius Ferrão via xCAT-user 
 wrote:

As for https://xcat.org/files/xcat/repos/ which is the actual repositories 
(.deb and .rpm) it’s only 35GB.

# du -sh xcat
35G xcat

My plan is to keep it also public: http://mirror.versatushpc.com.br/xcat/

Regards.

On 24 May 2023, at 00:26, Russell Jones  wrote:

How large is the xcat repository? If there's any concern about opening up rsync 
causing a bandwidth or availability issue,  maybe the community could come 
together to host a few public mirrors with rsync access, and I'd be happy to be 
one.

I run "official" public mirrors for epel, fedora, rocky, and CentOS stream. I 
have the infrastructure to donate :-)

On Tue, May 23, 2023, 6:36 PM Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:
Hi Kilian,

reposync is great, mainly with Red Hat, since it’s the only way to mirror Red 
Hat distributions due to it’s “subscription manager” nature. I already use it 
for RHEL, but it’s not that great when you want to mirror multiple repositories 
and multiple versions. Also with EL8 there’s the would —download-metadata issue.

We do use it when it’s a stateful cluster with a single arch and single 
distribution, but my objetive here is to have a mirror, not a reposync one.

Also as I said on the last message, the storage system has standard storage 
tools, like rsync. Reposync is not available.

Regards.

> On 23 May 2023, at 20:06, Kilian Cavalotti 
> mailto:kilian.cavalotti.w...@gmail.com>> 
> wrote:
>
> Hi all,
>
> We routinely use reposync [1] to mirror the xcat-core and xcat-dep
> repositories, without rsync, and without having to re-download already
> downloaded packages.
> [2] has pointers on how to use it.
>
> Hope this helps!
>
> [1]: https://linux.die.net/man/1/reposync
> [2]: https://access.redhat.com/solutions/23016
>
> Cheers,
> --
> Kilian
>
> On Tue, May 23, 2023 at 12:51 PM Nathan A Besaw via xCAT-user
> mailto:xcat-user@lists.sourceforge.net>> 
> wrote:
>>
>> Hi Vinícius,
>>
>> Currently rsync is not available from xcat.org<http://xcat.org/>, but we can 
>> consider enabling it.
>>
>> How frequently do you plan to sync your local mirror from the 
>> xcat.org<http://xcat.org/> version?
>>
>> For the initial sync, I think you can use wget or curl to recursively 
>> download everything from https://xcat.org/files/xcat/ (or whatever 
>> directories are relevant to you).
>> If you are going to refresh your mirror infrequently (after every release?), 
>> using wget or curl may be sufficient. If you want to resync more regularly I 
>> would prefer to use a solution that includes incremental copying so you 
>> don't have to redownload ever file every time.
>>
>> ____
>> From: Vinícius Ferrão via xCAT-user 
>> mailto:xcat-user@lists.sourceforge.net>>
>> Sent: Friday, May 19, 2023 1:02 PM
>> To: xCAT Users Mailing list 
>> mailto:xcat-user@lists.sourceforge.net>>
>> Cc: Vinícius Ferrão 
>> mailto:fer...@versatushpc.com.br>>
>> Subject: [EXTERNAL] [xcat-user] rsync on https://xcat.org/files/xcat/repos
>>
>> Hello, I would like to know if rsync is available on xCAT repository. I want 
>> to mirror it locally.
>>
>> I know that I can download the entire tarball from a given version, but I 
>> would like to use rsync to keep it updated.
>>
>> Thank you.
>>
>>
>>
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
> --
> Kilian
>
>
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] rsync on https://xcat.org/files/xcat/repos

2023-05-23 Thread Vinícius Ferrão via xCAT-user
I’m considering to run the mirror command every day, three times a day, at 
hours: 2, 7, 21 (UTC-3 time).
The ideia is to be set and forget.

I’m building the repository right now, for the initial copy I’ve used lftp 
(http://lftp.yar.ru), which is painful slow and really bad with delta updates, 
and with my organization is bad, since my ideas is to run rsync directly on the 
storage (TrueNAS), with lftp I had to mount the /var/repos directory on other 
machine just to run lftp.

Files are already available here:
<http://mirror.versatushpc.com.br/xcat>
Index of /xcat/<http://mirror.versatushpc.com.br/xcat>
mirror.versatushpc.com.br<http://mirror.versatushpc.com.br/xcat>
[X]<http://mirror.versatushpc.com.br/xcat>

rsync is definitely the best option to mirror due to delta sync as you said. 
Something like -aSH --delete --delete-delay --delay-updates would be good with 
rsync, considering that any symbolic link on xCAT side is relative, so it will 
not be a broken symlink after the mirror.

Thanks.

On 23 May 2023, at 16:50, Nathan A Besaw via xCAT-user 
 wrote:

Hi Vinícius,

Currently rsync is not available from xcat.org<http://xcat.org/>, but we can 
consider enabling it.

How frequently do you plan to sync your local mirror from the 
xcat.org<http://xcat.org/> version?

For the initial sync, I think you can use wget or curl to recursively download 
everything from https://xcat.org/files/xcat/ (or whatever directories are 
relevant to you).
If you are going to refresh your mirror infrequently (after every release?), 
using wget or curl may be sufficient. If you want to resync more regularly I 
would prefer to use a solution that includes incremental copying so you don't 
have to redownload ever file every time.

________
From: Vinícius Ferrão via xCAT-user 
Sent: Friday, May 19, 2023 1:02 PM
To: xCAT Users Mailing list 
Cc: Vinícius Ferrão 
Subject: [EXTERNAL] [xcat-user] rsync on https://xcat.org/files/xcat/repos

Hello, I would like to know if rsync is available on xCAT repository. I want to 
mirror it locally.

I know that I can download the entire tarball from a given version, but I would 
like to use rsync to keep it updated.

Thank you.



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] rsync on https://xcat.org/files/xcat/repos

2023-05-23 Thread Vinícius Ferrão via xCAT-user
Hi Kilian,

reposync is great, mainly with Red Hat, since it’s the only way to mirror Red 
Hat distributions due to it’s “subscription manager” nature. I already use it 
for RHEL, but it’s not that great when you want to mirror multiple repositories 
and multiple versions. Also with EL8 there’s the would —download-metadata issue.

We do use it when it’s a stateful cluster with a single arch and single 
distribution, but my objetive here is to have a mirror, not a reposync one.

Also as I said on the last message, the storage system has standard storage 
tools, like rsync. Reposync is not available.

Regards. 

> On 23 May 2023, at 20:06, Kilian Cavalotti  
> wrote:
> 
> Hi all,
> 
> We routinely use reposync [1] to mirror the xcat-core and xcat-dep
> repositories, without rsync, and without having to re-download already
> downloaded packages.
> [2] has pointers on how to use it.
> 
> Hope this helps!
> 
> [1]: https://linux.die.net/man/1/reposync
> [2]: https://access.redhat.com/solutions/23016
> 
> Cheers,
> --
> Kilian
> 
> On Tue, May 23, 2023 at 12:51 PM Nathan A Besaw via xCAT-user
>  wrote:
>> 
>> Hi Vinícius,
>> 
>> Currently rsync is not available from xcat.org, but we can consider enabling 
>> it.
>> 
>> How frequently do you plan to sync your local mirror from the xcat.org 
>> version?
>> 
>> For the initial sync, I think you can use wget or curl to recursively 
>> download everything from https://xcat.org/files/xcat/ (or whatever 
>> directories are relevant to you).
>> If you are going to refresh your mirror infrequently (after every release?), 
>> using wget or curl may be sufficient. If you want to resync more regularly I 
>> would prefer to use a solution that includes incremental copying so you 
>> don't have to redownload ever file every time.
>> 
>> 
>> From: Vinícius Ferrão via xCAT-user 
>> Sent: Friday, May 19, 2023 1:02 PM
>> To: xCAT Users Mailing list 
>> Cc: Vinícius Ferrão 
>> Subject: [EXTERNAL] [xcat-user] rsync on https://xcat.org/files/xcat/repos
>> 
>> Hello, I would like to know if rsync is available on xCAT repository. I want 
>> to mirror it locally.
>> 
>> I know that I can download the entire tarball from a given version, but I 
>> would like to use rsync to keep it updated.
>> 
>> Thank you.
>> 
>> 
>> 
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
> 
> 
> 
> -- 
> Kilian
> 
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] rsync on https://xcat.org/files/xcat/repos

2023-05-23 Thread Vinícius Ferrão via xCAT-user
As for https://xcat.org/files/xcat/repos/ which is the actual repositories 
(.deb and .rpm) it’s only 35GB.

# du -sh xcat
35G xcat

My plan is to keep it also public: http://mirror.versatushpc.com.br/xcat/

Regards.

On 24 May 2023, at 00:26, Russell Jones  wrote:

How large is the xcat repository? If there's any concern about opening up rsync 
causing a bandwidth or availability issue,  maybe the community could come 
together to host a few public mirrors with rsync access, and I'd be happy to be 
one.

I run "official" public mirrors for epel, fedora, rocky, and CentOS stream. I 
have the infrastructure to donate :-)

On Tue, May 23, 2023, 6:36 PM Vinícius Ferrão via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>> wrote:
Hi Kilian,

reposync is great, mainly with Red Hat, since it’s the only way to mirror Red 
Hat distributions due to it’s “subscription manager” nature. I already use it 
for RHEL, but it’s not that great when you want to mirror multiple repositories 
and multiple versions. Also with EL8 there’s the would —download-metadata issue.

We do use it when it’s a stateful cluster with a single arch and single 
distribution, but my objetive here is to have a mirror, not a reposync one.

Also as I said on the last message, the storage system has standard storage 
tools, like rsync. Reposync is not available.

Regards.

> On 23 May 2023, at 20:06, Kilian Cavalotti 
> mailto:kilian.cavalotti.w...@gmail.com>> 
> wrote:
>
> Hi all,
>
> We routinely use reposync [1] to mirror the xcat-core and xcat-dep
> repositories, without rsync, and without having to re-download already
> downloaded packages.
> [2] has pointers on how to use it.
>
> Hope this helps!
>
> [1]: https://linux.die.net/man/1/reposync
> [2]: https://access.redhat.com/solutions/23016
>
> Cheers,
> --
> Kilian
>
> On Tue, May 23, 2023 at 12:51 PM Nathan A Besaw via xCAT-user
> mailto:xcat-user@lists.sourceforge.net>> 
> wrote:
>>
>> Hi Vinícius,
>>
>> Currently rsync is not available from xcat.org<http://xcat.org/>, but we can 
>> consider enabling it.
>>
>> How frequently do you plan to sync your local mirror from the 
>> xcat.org<http://xcat.org/> version?
>>
>> For the initial sync, I think you can use wget or curl to recursively 
>> download everything from https://xcat.org/files/xcat/ (or whatever 
>> directories are relevant to you).
>> If you are going to refresh your mirror infrequently (after every release?), 
>> using wget or curl may be sufficient. If you want to resync more regularly I 
>> would prefer to use a solution that includes incremental copying so you 
>> don't have to redownload ever file every time.
>>
>> ____
>> From: Vinícius Ferrão via xCAT-user 
>> mailto:xcat-user@lists.sourceforge.net>>
>> Sent: Friday, May 19, 2023 1:02 PM
>> To: xCAT Users Mailing list 
>> mailto:xcat-user@lists.sourceforge.net>>
>> Cc: Vinícius Ferrão 
>> mailto:fer...@versatushpc.com.br>>
>> Subject: [EXTERNAL] [xcat-user] rsync on https://xcat.org/files/xcat/repos
>>
>> Hello, I would like to know if rsync is available on xCAT repository. I want 
>> to mirror it locally.
>>
>> I know that I can download the entire tarball from a given version, but I 
>> would like to use rsync to keep it updated.
>>
>> Thank you.
>>
>>
>>
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>> ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
>> https://lists.sourceforge.net/lists/listinfo/xcat-user
>
>
>
> --
> Kilian
>
>
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/xcat-user


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net<mailto:xCAT-user@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/xcat-user
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] rsync on https://xcat.org/files/xcat/repos

2023-05-20 Thread Vinícius Ferrão via xCAT-user
Hello, I would like to know if rsync is available on xCAT repository. I want to 
mirror it locally.

I know that I can download the entire tarball from a given version, but I would 
like to use rsync to keep it updated.

Thank you.



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


[xcat-user] Support for IBM Remote Supervisor Supervisor II (RSA-II)

2024-01-09 Thread Vinícius Ferrão via xCAT-user

Hello,

This thread may be offtopic on this list but I don’t have any other places to 
go with people may understand the question.

I’ve bought this card thinking that it would provide IPMI for being controlled 
by Confluent (and xCAT maybe...) but I think I misunderstood what the device 
provides.

Anyone knows if this card is supported? Does it provide IPMI over LAN?


Long story:
There’s an old IBM System x3550 (the first one) that I use to test things, and 
I was trying to add it as a compute node of Confluent but although it has an 
OOB Ethernet Interface named as management it didn’t even linked when a network 
cable was plugged.

So after spending countless hours trying to figure it out I’ve discovered that 
I should have an additional IBM RSA-2 Slimline Card on the system for this 
management port work.

I think I incorrectly assumed that this card would provide a classic IPMI over 
LAN interface since the server already has BMC configuration on the BIOS that I 
can even set the LAN settings like the IP address.

So I sourced one card in the used market and after 12h fighting with the card 
due to wrong firmwares, mismatches between the system BIOS and the car and 
broken download links on IBM website and that frustrating Fix Central webpage.

There still an BMC update that I could not do because the update package simply 
does not find the BMC on the server. Probably because the package is for EL5 
and I’m running EL7.

After fighting with this I was able to finally connect to the web interface 
that the RSA-2 provided. I can shutdown and power on the server, see some 
information and that’s it.

However I cannot control the system using ipmitool remotely and when using 
ipmitool in band the LAN settings are different from those on the RSA-II card. 
So I think all this configuration on the BIOS about the BMC, the ipmitool lan 
commands are all bogus on this system.

Basically the card is pretty much useless and I just wasted time and little 
money in this journey.

So is there any chance of making this work? Any workaround? Anyone that feels 
the pain or knows the hardware enough to fill in the gaps what I may be missing?

Thanks all.



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] [External] Re: RHEL9 support in xcat

2024-02-22 Thread Vinícius Ferrão via xCAT-user
A new updated was committed yesterday that addresses installation on 9.3.

On 22 Feb 2024, at 21:35, Noah, Stuart via xCAT-user 
 wrote:

Hello Markus,

Are there any new updates on EL9 support since 1/11.  I know that it hasn’t 
been much time but I’m trying to put a migration plan together to upgrade our 
older cluster including the OS.

Thank you

Sincerely,
Stuart

From: Imam Toufique mailto:techie...@gmail.com>>
Sent: Thursday, January 11, 2024 10:46 AM
To: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Subject: [External] Re: [xcat-user] RHEL9 support in xcat

Hi all, thank you so much for your prompt responses. I wanted to try out rocky9 
, that's why I was wondering. Markus, thanks for your valuable advice. Yeah, I 
wanted to get an idea about how 'ifcfg' would behave, you explained the
ZjQcmQRYFpfptBannerStart
WARNING: New Sender
You have not previously corresponded with this sender.
ZjQcmQRYFpfptBannerEnd
Hi all,
thank you so much for your prompt responses.  I wanted to try out rocky9 , 
that's why I was wondering.

Markus, thanks for your valuable advice.  Yeah, I wanted to get an idea about 
how 'ifcfg' would behave, you explained the issue exactly.
--IT

On Thu, Jan 11, 2024 at 10:37 AM Markus Hilger 
mailto:markus.hil...@megware.com>> wrote:
Hi,

yes, EL9 distros are working mostly fine.
Officially EL9 is in alpha state right now. See: 
https://github.com/xcat2/xcat-core/wiki/XCAT_2.16.5_Release_Notes
The next release will most probably move this to the next level, but it will 
take some more time. Please bear with us.

Some special configs like nics.nicextraparms etc. do not work (because ifcfg is 
deprecated and nicextraparams is not using nmcli yet).
But otherwise, management node installation and stateless/stateful node 
deployment is working fine.

Some example default osimage definitions with alma9:

# 

alma9-x86_64-install-compute:
objtype=osimage
imagetype=linux
osarch=x86_64
osdistroname=rhels9-x86_64
osname=linux
osvers=rhels9
otherpkgdir=/install/post/otherpkgs/alma9/x86_64

pkgdir=/install/repos/almalinux/9/BaseOS/x86_64/os,/install/repos/almalinux/9/AppStream/x86_64/os,/install/repos/almalinux/9/CRB/x86_64/os,/install/repos/epel/9/Everything/x86_64
pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels9.x86_64.pkglist
profile=compute
provmethod=install
synclists=/install/custom/install/alma/compute.alma9.synclist
template=/opt/xcat/share/xcat/install/rh/compute.rhels9.tmpl


# 

alma9-x86_64-netboot-compute:
objtype=osimage
exlist=/opt/xcat/share/xcat/netboot/rh/compute.rhels9.x86_64.exlist
imagetype=linux
osarch=x86_64
osdistroname=rhels9-x86_64
osname=linux
osvers=rhels9
otherpkgdir=/install/post/otherpkgs/alma9/x86_64
permission=755

pkgdir=/install/repos/almalinux/9/BaseOS/x86_64/os,/install/repos/almalinux/9/AppStream/x86_64/os,/install/repos/almalinux/9/CRB/x86_64/os,/install/repos/epel/9/Everything/x86_64
pkglist=/opt/xcat/share/xcat/netboot/rh/compute.rhels9.x86_64.pkglist

postinstall=/opt/xcat/share/xcat/netboot/rh/compute.rhels9.x86_64.postinstall
profile=compute
provmethod=netboot
rootimgdir=/install/netboot/alma9/x86_64/compute
synclists=/install/custom/netboot/alma/compute.alma9.synclist


​Mit freundlichen Grüßen / Kind regards


Markus Hilger



HPC Engineer



MEGWARE Computer Vertrieb und Service GmbH

Tel:  +49 3722 528-47



Nordstraße 19

markus.hil...@megware.com



09247 Chemnitz-Röhrsdorf, Germany

www.megware.com



Geschäftsführer: André Singer, Axel Auweter





Amtsgericht: Chemnitz HRB 584



Von: Ryan Novosielski via xCAT-user 
mailto:xcat-user@lists.sourceforge.net>>
Gesendet: Donnerstag, 11. Januar 2024 17:40
An: xCAT Users Mailing list 
mailto:xcat-user@lists.sourceforge.net>>
Cc: Ryan Novosielski mailto:novos...@rutgers.edu>>
Betreff: Re: [xcat-user] RHEL9 support in xcat

I don’t know what-all happened at SC or whether a group has come together to 
continue it, but just remember that there’s a thread on this mailing list about 
the fact that xCAT is not going to be maintained going forward.

--
#BlackLivesMatter

|| \\UTGERS, 
|---*O*---
||_// the State  | Ryan Novosielski - 
novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
||  \\of NJ  | Office of Advanced Research Computing - MSB A555B, Newark
 `'


On Jan 10, 2024, at 11:54, Imam Toufique