Re: [lustre-discuss] 2.15 install failure

2023-08-04 Thread Laura Hild via lustre-discuss
I want to say I used to have a similar problem with...amdgpu-dkms, maybe?  
Where a reinstall or update operation wouldn't work right, seemingly because 
the cleanup for the previous version of the package would wipe out what the new 
one installed, so my procedure became to remove and install in two separate 
operations.  Whether it was the fault of DKMS itself or of amdgpu-dkms' 
packaging, I don't know, but even if it's the latter it suggests to me that 
DKMS doesn't make it easy to package robustly.
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


[lustre-discuss] 2.15 install failure

2023-08-04 Thread Thomas Roth via lustre-discuss

Hi all,

returning to my Lustre installations, the curious failures continue...

- Download of 2.15.3 for el8.8 from Whamcloud
- Installation of a server with rocky 8.8 (I mean, why not,  while it still 
exists...)
- Want an ldiskfs server, so
> dnf install lustre lustre-osd-ldiskfs-mount lustre-ldiskfs-dkms
 -->  Fails because the full ext4 source is not present.

I wonder whether I got the workaround from this mailing list, but it should 
really be in some official documentation or better not necessary at all:

- Rocky 8.8 installs with kernel 4.18.0-477.15.1, so download 
'kernel-4.18.0-477.15.1.el8_8.src.rpm'
> rpm -i ./kernel-4.18.0-477.15.1.el8_8.src.rpm
> tar xJf rpmbuild/SOURCES/linux-4.18.0-477.15.1.el8_8.tar.xz
> cp -a linux-4.18.0-477.15.1.el8_8/fs/ext4/* 
/usr/src/kernels/4.18.0-477.15.1.el8_8.x86_64/fs/ext4/


Of course, at this stage, 'lustre-ldiskfs-dkms' is already installed, so
> dnf reinstall lustre-ldiskfs-dkms

This plainly prints out that dkms is successfully installing / compiling all 
the modules, then prints
>  Running scriptlet: lustre-ldiskfs-dkms-2.15.3-1.el8.noarch 
  > 2/2

> Deprecated feature: REMAKE_INITRD 
(/var/lib/dkms/lustre-ldiskfs/2.15.3/source/dkms.conf)
> Deprecated feature: REMAKE_INITRD 
(/var/lib/dkms/lustre-ldiskfs/2.15.3/source/dkms.conf)
> Module lustre-ldiskfs-2.15.3 for kernel 4.18.0-477.15.1.el8_8.x86_64 (x86_64).
> Before uninstall, this module version was ACTIVE on this kernel.
> Removing any linked weak-modules

and the uninstalls all the modules 

Even the /var/lib/dkms/lustre-ldiskfs gets removed, so this machine is clean and pristine, just that dnf/rpm believe that lustre-ldiskfs-dkms is 
already installed. ;-)


(These messages printed between creation and destruction, they do not really 
indicate any kind of trouble, do they?)


Well. we all know we are dealing with computers and not with deterministic 
machines, so
> dnf remove lustre lustre-ldiskfs-dkms lustre-osd-ldiskfs-mount
and
> dnf install lustre-ldiskfs-dkms

(Drum roll...) Lustre modules get compiled, installed _and_ _not_ removed.


('modprobe lustre' works, 'dnf install lustre lustre-osd-ldiskfs-mount' does 
not create new havoc)

I'm flabbergasted and really have no idea how I misconfigured a simple, minimal 
el8.8 installation into this kind of behavior.


Cheers
Thomas


--
Thomas Roth   IT-HPC-Linux
Location: SB3 2.291   Phone: 1453

___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org