Re: [lustre-discuss] LNet nid down after some thing changed the NICs

2023-02-18 Thread CJ Yin via lustre-discuss
Hi Chris,

Thanks for your help. I have collected the relevant logs according to your
hints. But I need an account to open a ticket on Jira. I have sent an
email to the administrator at i...@whamcloud.com. I was wondering if this
is the correct way to apply for an account. I only found this email on the
site.

Regards,
Chuanjun

Horn, Chris  于2023年2月18日周六 00:52写道:

> If deleting and re-adding it restores the status to up then this sounds
> like a bug to me.
>
>
>
> Can you enable debug tracing, reproduce the issue, and add this
> information to a ticket?
>
> To enable/gather debug:
>
> # lctl set_param debug=+net
> 
> # lctl dk > /tmp/dk.log
>
> You can create a ticket at https://jira.whamcloud.com/
>
> Please provide the dk.log with the ticket.
>
>
>
> Thanks,
>
> Chris Horn
>
>
>
> *From: *lustre-discuss  on
> behalf of 腐朽银 via lustre-discuss 
> *Date: *Friday, February 17, 2023 at 2:53 AM
> *To: *lustre-discuss@lists.lustre.org 
> *Subject: *[lustre-discuss] LNet nid down after some thing changed the
> NICs
>
> Hi,
>
>
>
> I encountered a problem when using Lustre Client on k8s with kubenet. Very
> happy if you could help me.
>
>
>
> My LNet configuration is:
>
>
>
> net:
> - net type: lo
>   local NI(s):
> - nid: 0@lo
>   status: up
> - net type: tcp
>   local NI(s):
> - nid: 10.224.0.5@tcp
>   status: up
>   interfaces:
>   0: eth0
>
>
>
> It works. But after I deploy or delete a pod on the node. The nid goes
> down like:
>
>
>
> - nid: 10.224.0.5@tcp
>   status: down
>   interfaces:
>   0: eth0
>
>
>
> k8s uses veth pairs, so it will add or delete network interfaces when
> deploying or deleting pods. But it doesn't touch the eth0 NIC. I can fix it
> by deleting the tcp net by `lnetctl net del` and re-add it by `lnetctl net
> add`. But I need to do this every time after a pod is scheduled to this
> node.
>
>
>
> My node OS is Ubuntu 18.04 5.4.0-1101-azure. The Lustre Client is built by
> myself from 2.15.1. Is this an expected LNet behavior or I got something
> wrong? I re-build and tested it several times and got the same problem.
>
>
>
> Regards,
>
> Chuanjun
>
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] ZFS Support For Lustre

2023-02-18 Thread Hans Henrik Happe via lustre-discuss

Hi,

The repos, in general, only work for the kernel they were build for. 
This will be the supported kernel by the release. Look in the changelog. 
For 2.15.1:


https://wiki.lustre.org/Lustre_2.15.1_Changelog

To make newer kernels work you have to compile yourself and that might 
not even work without patching and hasn't gone through testing.


You are better of following the supported kernel on the servers. The 
client is usually more likely to compile/work on newer kernels, but 
patching might be needed.


Cheers,
Hans Henrik

On 14.02.2023 13.16, Nick dan via lustre-discuss wrote:

Hi

I am using Lustre Version 2.15.1 on RedHat 8.8
As mentioned in the link, 
https://wiki.whamcloud.com/display/PUB/Lustre+Support+Matrix , the ZFS 
Version required is 2.1.2.
However, when I am trying to install ZFS from 
https://downloads.whamcloud.com/public/lustre/lustre-2.15.1/el8.6/server/RPMS/x86_64/

I am getting the following error
[root@st01 user]# yum install 
https://downloads.whamcloud.com/public/lustre/lustre-2.15.1/el8.6/server/RPMS/x86_64/zfs-2.1.2-1.el8.x86_64.rpm

Updating Subscription Management repositories.
Last metadata expiration check: 2:01:30 ago on Tue 14 Feb 2023 
03:38:48 PM IST.

zfs-2.1.2-1.el8.x86_64.rpm                  248 kB/s | 649 kB     00:02
Error:
 Problem: conflicting requests
  - nothing provides zfs-kmod = 2.1.2 needed by zfs-2.1.2-1.el8.x86_64
(try to add '--skip-broken' to skip uninstallable packages or 
'--nobest' to use not only best candidate packages)


I have installed the other required packages like libzfs, libzpool, 
libnvpair, libutil.


I am not able to download kmod-zfs version 2.1.2, as the latest 
version getting downloaded is 2.1.9


Can you help with this or suggest another way to download all 
supported ZFS Packages?


Thanks,
Nick Dan

___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Access times for file (file heat)

2023-02-18 Thread Andreas Dilger via lustre-discuss
Anna, there was a client-side file heat mechanism added a few years ago, but I 
don't know if it is fully functional today.

lctl get_param llite.*.*heat*
llite.myth-979380fc1800.file_heat=1
llite.myth-979380fc1800.heat_decay_percentage=80
llite.myth-979380fc1800.heat_period_second=60

And then "lfs heat_get " to dump the file heat,  it there haven't been 
any good tools developed yet to list top heat files. 

Cheers, Andreas

> On Feb 7, 2023, at 08:56, Anna Fuchs via lustre-discuss 
>  wrote:
> 
> Hello,
> 
> is there a way to see how many times a file has been accessed ever (like a 
> heat map)?
> 
> Thanks
> Anna
> 
> -- 
> Anna Fuchs
> Universität Hamburg
> https://wr.informatik.uni-hamburg.de
> 
> anna.fu...@informatik.uni-hamburg.de
> https://wr.informatik.uni-hamburg.de/people/anna_fuchs
> 
> 
> 
> ___
> lustre-discuss mailing list
> lustre-discuss@lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org