Have you made any progress on this problem or are you still stuck?

Can you deploy the node with a diskless image based on the same OS and see
the disks from within the booted OS?

The newest version of Ubuntu 18 that xCAT officially supports is 18.04.2.
Several xCAT users have moved forward to 18.04.5, but
there appear to be some issues with xCAT and Ubuntu 18.04.5 that are not
well understood.

I don't have access to this combination of hardware and OS, perhaps other
members of the list have some experience with this combination.



From:   Calvin Dodge <caldo...@gmail.com>
To:     xcat-user@lists.sourceforge.net
Date:   03/12/2021 06:41 PM
Subject:        [EXTERNAL] [xcat-user] Problem deploying Dell C6420 from Ubuntu
            18.04.05



We are trying to deploy C6420 nodes with a diskfull image.  The
deployment hangs about about 9.5 seconds after the kernel recognizes
the network and USB devices.

Installment details:  Head node was deployed using the live server ISO
ubuntu-18.04.5-live-server-amd64.iso.
The xcat-go install process created the /install/OS folder using that
ISO.  But we could not nodeset a node with that osimage name until we
ran copycds manually with the regular server ISO
ubuntu-18.04.5-server-amd64.iso.

We've seen other strangenesses, like xCAT looking for a folder named
"ubuntu-" when we tried to run genimage to create a diskless image

Meanwhile, the nodes are using PERC controllers for storage, which
uses the megaraid_sas kernel module.  When we unpack the initrd used
in the initial PXE load, we don't find the kernel module there.  Could
that be the source of our deployment hang?  We see instructions on
adding modules, but they don't appear to be relevant, because (1) Dell
doesn't seem to provide a driver disk for the initial kernel
(5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
distribution that doesn't use RPMs.

We CAN install Ubuntu on the node using the server ISO, so the
necessary drivers are present there, at least.  So it seems to be an
xCAT issue.

Has anyone else encountered this issue?  If not, how can we diagnose
it, beyond adding (an as yet unfound) megaraid_sas kernel module to
the initial PXE initrd?

Calvin Dodge
[attachment "cat_node_install_hung[33146].PNG" deleted by Nathan A
Besaw/Poughkeepsie/IBM] _______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=95pZYYPXXV-5mn9YO7FX6q2JdbPYeCR6fMnFTgqZ_M8&m=jOGxgBpk2RY0sFkf4czZC_mW22LlbQeGFkAqdSoeE-M&s=aq3MGS4v8yB6srQiaWjx2rj79VGt3k7500bcUFnxfXw&e=



_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to