Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-08-03 Thread Calvin Dodge
And ... when using the 18.04.2 default diskless image, the system hangs
about 5-6 seconds after boot.  It doesn't even provide the Alt-F2 screen
for using the shell.

Sigh ...

Calvin Dodge

On Tue, Aug 3, 2021 at 5:14 PM Calvin Dodge  wrote:

> We had to let this sit for a while due to higher priority items.
>
> After the deployment stops, we can login to the shell in the console
> (Alt-F2).  It appears at least one issue is that the initrd image isn't set
> up correctly, because megaraid_sas isn't loaded, and "modprobe
> megaraid_sas" fails with "module not found".  The modprobe DOES work after
> running "depmod -a", but I would think that is something that should have
> been done before creating the image. I have no clue on how to proceed after
> the disk is recognized.
>
> The same thing happens with Ubuntu 18.04.2.  At present I'm trying to
> create a diskless image for 18.04.2.  During this phase I've gotten many
> messages like "ldd: ./linux-vdso.so.1: No such file or directory", but
> genimage appears to be doing something.  I'll try packimage, then
> deploying.  At present, I think Ubuntu 18.04 is very poorly supported by
> xCAT.
>
> Sincerely,
>
> Calvin Dodge
>
>
>
> On Wed, Mar 17, 2021 at 8:30 AM Nathan A Besaw  wrote:
>
>> Have you made any progress on this problem or are you still stuck?
>>
>> Can you deploy the node with a diskless image based on the same OS and
>> see the disks from within the booted OS?
>>
>> The newest version of Ubuntu 18 that xCAT officially supports is 18.04.2.
>> Several xCAT users have moved forward to 18.04.5, but
>> there appear to be some issues with xCAT and Ubuntu 18.04.5 that are not
>> well understood.
>>
>> I don't have access to this combination of hardware and OS, perhaps other
>> members of the list have some experience with this combination.
>>
>> [image: Inactive hide details for Calvin Dodge ---03/12/2021 06:41:04
>> PM---We are trying to deploy C6420 nodes with a diskfull image.]Calvin
>> Dodge ---03/12/2021 06:41:04 PM---We are trying to deploy C6420 nodes with
>> a diskfull image. The deployment hangs about about 9.5 sec
>>
>> From: Calvin Dodge 
>> To: xcat-user@lists.sourceforge.net
>> Date: 03/12/2021 06:41 PM
>> Subject: [EXTERNAL] [xcat-user] Problem deploying Dell C6420 from Ubuntu
>> 18.04.05
>> --
>>
>>
>>
>> We are trying to deploy C6420 nodes with a diskfull image.  The
>> deployment hangs about about 9.5 seconds after the kernel recognizes
>> the network and USB devices.
>>
>> Installment details:  Head node was deployed using the live server ISO
>> ubuntu-18.04.5-live-server-amd64.iso.
>> The xcat-go install process created the /install/OS folder using that
>> ISO.  But we could not nodeset a node with that osimage name until we
>> ran copycds manually with the regular server ISO
>> ubuntu-18.04.5-server-amd64.iso.
>>
>> We've seen other strangenesses, like xCAT looking for a folder named
>> "ubuntu-" when we tried to run genimage to create a diskless image
>>
>> Meanwhile, the nodes are using PERC controllers for storage, which
>> uses the megaraid_sas kernel module.  When we unpack the initrd used
>> in the initial PXE load, we don't find the kernel module there.  Could
>> that be the source of our deployment hang?  We see instructions on
>> adding modules, but they don't appear to be relevant, because (1) Dell
>> doesn't seem to provide a driver disk for the initial kernel
>> (5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
>> distribution that doesn't use RPMs.
>>
>> We CAN install Ubuntu on the node using the server ISO, so the
>> necessary drivers are present there, at least.  So it seems to be an
>> xCAT issue.
>>
>> Has anyone else encountered this issue?  If not, how can we diagnose
>> it, beyond adding (an as yet unfound) megaraid_sas kernel module to
>> the initial PXE initrd?
>>
>> Calvin Dodge
>> [attachment "cat_node_install_hung[33146].PNG" deleted by Nathan A
>> Besaw/Poughkeepsie/IBM] ___
>> xCAT-user mailing list
>> xCAT-user@lists.sourceforge.net
>>
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=95pZYYPXXV-5mn9YO7FX6q2JdbPYeCR6fMnFTgqZ_M8=jOGxgBpk2RY0sFkf4czZC_mW22LlbQeGFkAqdSoeE-M=aq3MGS4v8yB6srQiaWjx2rj79VGt3k7500bcUFnxfXw=
>>
>>
>>
>>
>>
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-08-03 Thread Calvin Dodge
We had to let this sit for a while due to higher priority items.

After the deployment stops, we can login to the shell in the console
(Alt-F2).  It appears at least one issue is that the initrd image isn't set
up correctly, because megaraid_sas isn't loaded, and "modprobe
megaraid_sas" fails with "module not found".  The modprobe DOES work after
running "depmod -a", but I would think that is something that should have
been done before creating the image. I have no clue on how to proceed after
the disk is recognized.

The same thing happens with Ubuntu 18.04.2.  At present I'm trying to
create a diskless image for 18.04.2.  During this phase I've gotten many
messages like "ldd: ./linux-vdso.so.1: No such file or directory", but
genimage appears to be doing something.  I'll try packimage, then
deploying.  At present, I think Ubuntu 18.04 is very poorly supported by
xCAT.

Sincerely,

Calvin Dodge



On Wed, Mar 17, 2021 at 8:30 AM Nathan A Besaw  wrote:

> Have you made any progress on this problem or are you still stuck?
>
> Can you deploy the node with a diskless image based on the same OS and see
> the disks from within the booted OS?
>
> The newest version of Ubuntu 18 that xCAT officially supports is 18.04.2.
> Several xCAT users have moved forward to 18.04.5, but
> there appear to be some issues with xCAT and Ubuntu 18.04.5 that are not
> well understood.
>
> I don't have access to this combination of hardware and OS, perhaps other
> members of the list have some experience with this combination.
>
> [image: Inactive hide details for Calvin Dodge ---03/12/2021 06:41:04
> PM---We are trying to deploy C6420 nodes with a diskfull image.]Calvin
> Dodge ---03/12/2021 06:41:04 PM---We are trying to deploy C6420 nodes with
> a diskfull image. The deployment hangs about about 9.5 sec
>
> From: Calvin Dodge 
> To: xcat-user@lists.sourceforge.net
> Date: 03/12/2021 06:41 PM
> Subject: [EXTERNAL] [xcat-user] Problem deploying Dell C6420 from Ubuntu
> 18.04.05
> --
>
>
>
> We are trying to deploy C6420 nodes with a diskfull image.  The
> deployment hangs about about 9.5 seconds after the kernel recognizes
> the network and USB devices.
>
> Installment details:  Head node was deployed using the live server ISO
> ubuntu-18.04.5-live-server-amd64.iso.
> The xcat-go install process created the /install/OS folder using that
> ISO.  But we could not nodeset a node with that osimage name until we
> ran copycds manually with the regular server ISO
> ubuntu-18.04.5-server-amd64.iso.
>
> We've seen other strangenesses, like xCAT looking for a folder named
> "ubuntu-" when we tried to run genimage to create a diskless image
>
> Meanwhile, the nodes are using PERC controllers for storage, which
> uses the megaraid_sas kernel module.  When we unpack the initrd used
> in the initial PXE load, we don't find the kernel module there.  Could
> that be the source of our deployment hang?  We see instructions on
> adding modules, but they don't appear to be relevant, because (1) Dell
> doesn't seem to provide a driver disk for the initial kernel
> (5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
> distribution that doesn't use RPMs.
>
> We CAN install Ubuntu on the node using the server ISO, so the
> necessary drivers are present there, at least.  So it seems to be an
> xCAT issue.
>
> Has anyone else encountered this issue?  If not, how can we diagnose
> it, beyond adding (an as yet unfound) megaraid_sas kernel module to
> the initial PXE initrd?
>
> Calvin Dodge
> [attachment "cat_node_install_hung[33146].PNG" deleted by Nathan A
> Besaw/Poughkeepsie/IBM] ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=95pZYYPXXV-5mn9YO7FX6q2JdbPYeCR6fMnFTgqZ_M8=jOGxgBpk2RY0sFkf4czZC_mW22LlbQeGFkAqdSoeE-M=aq3MGS4v8yB6srQiaWjx2rj79VGt3k7500bcUFnxfXw=
>
>
>
>
>
___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-03-17 Thread Calvin Dodge
Unfortunately, it appears xCAT breaks when trying to create a diskless
image.  That's probably another issue with 18.04.5.  I think we'll
just install 18.04.2 on the head node, and repeat all our
configuration steps.

Calvin Dodge

On Wed, Mar 17, 2021 at 8:30 AM Nathan A Besaw  wrote:
>
> Have you made any progress on this problem or are you still stuck?
>
> Can you deploy the node with a diskless image based on the same OS and see 
> the disks from within the booted OS?
>
> The newest version of Ubuntu 18 that xCAT officially supports is 18.04.2. 
> Several xCAT users have moved forward to 18.04.5, but
> there appear to be some issues with xCAT and Ubuntu 18.04.5 that are not well 
> understood.
>
> I don't have access to this combination of hardware and OS, perhaps other 
> members of the list have some experience with this combination.
>
> Calvin Dodge ---03/12/2021 06:41:04 PM---We are trying to deploy C6420 nodes 
> with a diskfull image. The deployment hangs about about 9.5 sec
>
> From: Calvin Dodge 
> To: xcat-user@lists.sourceforge.net
> Date: 03/12/2021 06:41 PM
> Subject: [EXTERNAL] [xcat-user] Problem deploying Dell C6420 from Ubuntu 
> 18.04.05
>
> 
>
>
>
> We are trying to deploy C6420 nodes with a diskfull image.  The
> deployment hangs about about 9.5 seconds after the kernel recognizes
> the network and USB devices.
>
> Installment details:  Head node was deployed using the live server ISO
> ubuntu-18.04.5-live-server-amd64.iso.
> The xcat-go install process created the /install/OS folder using that
> ISO.  But we could not nodeset a node with that osimage name until we
> ran copycds manually with the regular server ISO
> ubuntu-18.04.5-server-amd64.iso.
>
> We've seen other strangenesses, like xCAT looking for a folder named
> "ubuntu-" when we tried to run genimage to create a diskless image
>
> Meanwhile, the nodes are using PERC controllers for storage, which
> uses the megaraid_sas kernel module.  When we unpack the initrd used
> in the initial PXE load, we don't find the kernel module there.  Could
> that be the source of our deployment hang?  We see instructions on
> adding modules, but they don't appear to be relevant, because (1) Dell
> doesn't seem to provide a driver disk for the initial kernel
> (5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
> distribution that doesn't use RPMs.
>
> We CAN install Ubuntu on the node using the server ISO, so the
> necessary drivers are present there, at least.  So it seems to be an
> xCAT issue.
>
> Has anyone else encountered this issue?  If not, how can we diagnose
> it, beyond adding (an as yet unfound) megaraid_sas kernel module to
> the initial PXE initrd?
>
> Calvin Dodge
> [attachment "cat_node_install_hung[33146].PNG" deleted by Nathan A 
> Besaw/Poughkeepsie/IBM] ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=95pZYYPXXV-5mn9YO7FX6q2JdbPYeCR6fMnFTgqZ_M8=jOGxgBpk2RY0sFkf4czZC_mW22LlbQeGFkAqdSoeE-M=aq3MGS4v8yB6srQiaWjx2rj79VGt3k7500bcUFnxfXw=
>
>
>


___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-03-17 Thread Nathan A Besaw

Have you made any progress on this problem or are you still stuck?

Can you deploy the node with a diskless image based on the same OS and see
the disks from within the booted OS?

The newest version of Ubuntu 18 that xCAT officially supports is 18.04.2.
Several xCAT users have moved forward to 18.04.5, but
there appear to be some issues with xCAT and Ubuntu 18.04.5 that are not
well understood.

I don't have access to this combination of hardware and OS, perhaps other
members of the list have some experience with this combination.



From:   Calvin Dodge 
To: xcat-user@lists.sourceforge.net
Date:   03/12/2021 06:41 PM
Subject:[EXTERNAL] [xcat-user] Problem deploying Dell C6420 from Ubuntu
18.04.05



We are trying to deploy C6420 nodes with a diskfull image.  The
deployment hangs about about 9.5 seconds after the kernel recognizes
the network and USB devices.

Installment details:  Head node was deployed using the live server ISO
ubuntu-18.04.5-live-server-amd64.iso.
The xcat-go install process created the /install/OS folder using that
ISO.  But we could not nodeset a node with that osimage name until we
ran copycds manually with the regular server ISO
ubuntu-18.04.5-server-amd64.iso.

We've seen other strangenesses, like xCAT looking for a folder named
"ubuntu-" when we tried to run genimage to create a diskless image

Meanwhile, the nodes are using PERC controllers for storage, which
uses the megaraid_sas kernel module.  When we unpack the initrd used
in the initial PXE load, we don't find the kernel module there.  Could
that be the source of our deployment hang?  We see instructions on
adding modules, but they don't appear to be relevant, because (1) Dell
doesn't seem to provide a driver disk for the initial kernel
(5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
distribution that doesn't use RPMs.

We CAN install Ubuntu on the node using the server ISO, so the
necessary drivers are present there, at least.  So it seems to be an
xCAT issue.

Has anyone else encountered this issue?  If not, how can we diagnose
it, beyond adding (an as yet unfound) megaraid_sas kernel module to
the initial PXE initrd?

Calvin Dodge
[attachment "cat_node_install_hung[33146].PNG" deleted by Nathan A
Besaw/Poughkeepsie/IBM] ___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_xcat-2Duser=DwICAg=jf_iaSHvJObTbx-siA1ZOg=95pZYYPXXV-5mn9YO7FX6q2JdbPYeCR6fMnFTgqZ_M8=jOGxgBpk2RY0sFkf4czZC_mW22LlbQeGFkAqdSoeE-M=aq3MGS4v8yB6srQiaWjx2rj79VGt3k7500bcUFnxfXw=



___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user


Re: [xcat-user] Problem deploying Dell C6420 from Ubuntu 18.04.05

2021-03-12 Thread Vinícius Ferrão via xCAT-user
Are you using UEFI or Legacy BIOS?

netboot=pxe or netboot=xnba?


Sent from my iPhone

> On 12 Mar 2021, at 20:42, Calvin Dodge  wrote:
> 
> We are trying to deploy C6420 nodes with a diskfull image.  The
> deployment hangs about about 9.5 seconds after the kernel recognizes
> the network and USB devices.
> 
> Installment details:  Head node was deployed using the live server ISO
> ubuntu-18.04.5-live-server-amd64.iso.
> The xcat-go install process created the /install/OS folder using that
> ISO.  But we could not nodeset a node with that osimage name until we
> ran copycds manually with the regular server ISO
> ubuntu-18.04.5-server-amd64.iso.
> 
> We've seen other strangenesses, like xCAT looking for a folder named
> "ubuntu-" when we tried to run genimage to create a diskless image
> 
> Meanwhile, the nodes are using PERC controllers for storage, which
> uses the megaraid_sas kernel module.  When we unpack the initrd used
> in the initial PXE load, we don't find the kernel module there.  Could
> that be the source of our deployment hang?  We see instructions on
> adding modules, but they don't appear to be relevant, because (1) Dell
> doesn't seem to provide a driver disk for the initial kernel
> (5.4.0-22), and (2) the "use RPM" approach is unlikely to work with a
> distribution that doesn't use RPMs.
> 
> We CAN install Ubuntu on the node using the server ISO, so the
> necessary drivers are present there, at least.  So it seems to be an
> xCAT issue.
> 
> Has anyone else encountered this issue?  If not, how can we diagnose
> it, beyond adding (an as yet unfound) megaraid_sas kernel module to
> the initial PXE initrd?
> 
> Calvin Dodge
> 
> ___
> xCAT-user mailing list
> xCAT-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/xcat-user

___
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user