Re: [lustre-discuss] Could not read from remote repository

2024-04-11 Thread Jeff Johnson
Glad to help!

On Thu, Apr 11, 2024 at 12:03 PM Jannek Squar
 wrote:
>
> There must have been a problem with the connection, the same line is now
> working again. Thanks for your help.
>
> On 11/04/2024 20:54, Jeff Johnson wrote:
> > Works fine for me. Make sure you don't have a hyperlink embedded in
> > text or something.
> >
> > git clone git://git.whamcloud.com/fs/lustre-release.git
> >
> > [jeff@spinaltap ~/Devel] $ git clone
> > git://git.whamcloud.com/fs/lustre-release.git
> > Cloning into 'lustre-release'...
> > remote: Counting objects: 386278, done.
> > remote: Compressing objects: 100% (81507/81507), done.
> > remote: Total 386278 (delta 286760), reused 384310 (delta 284792)
> > Receiving objects: 100% (386278/386278), 162.22 MiB | 5.73 MiB/s, done.
> > Resolving deltas: 100% (286760/286760), done.
> >
> > On Tue, Apr 9, 2024 at 3:20 AM Jannek Squar via lustre-discuss
> >  wrote:
> >>
> >> Hey,
> >>
> >> I tried to clone the source code via `git clone
> >> git://git.whamcloud.com/fs/lustre-release.git` but got an error:
> >>
> >> """
> >> fatal: Could not read from remote repository.
> >>
> >> Please make sure you have the correct access rights
> >> and the repository exists.
> >> """
> >>
> >> Is there something going on with the repository or is the error probably
> >> on my side?
> >>
> >> Cheers
> >> Jannek
> >>
> >> --
> >> Jannek Squar
> >> Universität Hamburg
> >> Fakultät für Mathematik, Informatik und Naturwissenschaften
> >> Fachbereich Informatik
> >> Arbeitsbereich Wissenschaftliches Rechnen
> >>
> >> Bundesstraße 45a
> >> D-20146 Hamburg
> >>
> >> Tel: +49 40 460094-219
> >> ___
> >> lustre-discuss mailing list
> >> lustre-discuss@lists.lustre.org
> >> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
> >
> >
> >
>
> --
> Jannek Squar
> Universität Hamburg
> Fakultät für Mathematik, Informatik und Naturwissenschaften
> Fachbereich Informatik
> Arbeitsbereich Wissenschaftliches Rechnen
>
> Bundesstraße 45a
> D-20146 Hamburg
>
> Tel: +49 40 460094-219



-- 
--
Jeff Johnson
Co-Founder
Aeon Computing

jeff.john...@aeoncomputing.com
www.aeoncomputing.com
t: 858-412-3810 x1001   f: 858-412-3845
m: 619-204-9061

4170 Morena Boulevard, Suite C - San Diego, CA 92117

High-Performance Computing / Lustre Filesystems / Scale-out Storage
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Could not read from remote repository

2024-04-11 Thread Jannek Squar via lustre-discuss
There must have been a problem with the connection, the same line is now 
working again. Thanks for your help.


On 11/04/2024 20:54, Jeff Johnson wrote:

Works fine for me. Make sure you don't have a hyperlink embedded in
text or something.

git clone git://git.whamcloud.com/fs/lustre-release.git

[jeff@spinaltap ~/Devel] $ git clone
git://git.whamcloud.com/fs/lustre-release.git
Cloning into 'lustre-release'...
remote: Counting objects: 386278, done.
remote: Compressing objects: 100% (81507/81507), done.
remote: Total 386278 (delta 286760), reused 384310 (delta 284792)
Receiving objects: 100% (386278/386278), 162.22 MiB | 5.73 MiB/s, done.
Resolving deltas: 100% (286760/286760), done.

On Tue, Apr 9, 2024 at 3:20 AM Jannek Squar via lustre-discuss
 wrote:


Hey,

I tried to clone the source code via `git clone
git://git.whamcloud.com/fs/lustre-release.git` but got an error:

"""
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
"""

Is there something going on with the repository or is the error probably
on my side?

Cheers
Jannek

--
Jannek Squar
Universität Hamburg
Fakultät für Mathematik, Informatik und Naturwissenschaften
Fachbereich Informatik
Arbeitsbereich Wissenschaftliches Rechnen

Bundesstraße 45a
D-20146 Hamburg

Tel: +49 40 460094-219
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org






--
Jannek Squar
Universität Hamburg
Fakultät für Mathematik, Informatik und Naturwissenschaften
Fachbereich Informatik
Arbeitsbereich Wissenschaftliches Rechnen

Bundesstraße 45a
D-20146 Hamburg

Tel: +49 40 460094-219
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Could not read from remote repository

2024-04-11 Thread Jeff Johnson
Works fine for me. Make sure you don't have a hyperlink embedded in
text or something.

git clone git://git.whamcloud.com/fs/lustre-release.git

[jeff@spinaltap ~/Devel] $ git clone
git://git.whamcloud.com/fs/lustre-release.git
Cloning into 'lustre-release'...
remote: Counting objects: 386278, done.
remote: Compressing objects: 100% (81507/81507), done.
remote: Total 386278 (delta 286760), reused 384310 (delta 284792)
Receiving objects: 100% (386278/386278), 162.22 MiB | 5.73 MiB/s, done.
Resolving deltas: 100% (286760/286760), done.

On Tue, Apr 9, 2024 at 3:20 AM Jannek Squar via lustre-discuss
 wrote:
>
> Hey,
>
> I tried to clone the source code via `git clone
> git://git.whamcloud.com/fs/lustre-release.git` but got an error:
>
> """
> fatal: Could not read from remote repository.
>
> Please make sure you have the correct access rights
> and the repository exists.
> """
>
> Is there something going on with the repository or is the error probably
> on my side?
>
> Cheers
> Jannek
>
> --
> Jannek Squar
> Universität Hamburg
> Fakultät für Mathematik, Informatik und Naturwissenschaften
> Fachbereich Informatik
> Arbeitsbereich Wissenschaftliches Rechnen
>
> Bundesstraße 45a
> D-20146 Hamburg
>
> Tel: +49 40 460094-219
> ___
> lustre-discuss mailing list
> lustre-discuss@lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org



-- 
--
Jeff Johnson
Co-Founder
Aeon Computing

jeff.john...@aeoncomputing.com
www.aeoncomputing.com
t: 858-412-3810 x1001   f: 858-412-3845
m: 619-204-9061

4170 Morena Boulevard, Suite C - San Diego, CA 92117

High-Performance Computing / Lustre Filesystems / Scale-out Storage
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] ko2iblnd.conf

2024-04-11 Thread Daniel Szkola via lustre-discuss
On the server node(s):

options ko2iblnd-opa peer_credits=32 peer_credits_hiw=16 credits=1024 
concurrent_sends=64 ntx=2048 map_on_demand=256 fmr_pool_size=2048 
fmr_flush_trigger=512 fmr_cache=1 conns_per_peer=4

On clients:

options ko2iblnd peer_credits=128 peer_credits_hiw=64 credits=1024 
concurrent_sends=256 ntx=2048 map_on_demand=32 fmr_pool_size=2048 
fmr_flush_trigger=512 fmr_cache=1 conns_per_peer=4

My concern isn’t so much the mismatch because I know that’s an issue but rather 
what numbers we should settle on with a recent lustre build. I also see the 
ko2iblnd-opa in the server config, which means because the server is actually 
loading ko2iblnd that maybe defaults are used?

What made me look was we were seeing lots of:
LNetError: 2961324:0:(o2iblnd_cb.c:2612:kiblnd_passive_connect()) Can't accept 
conn from xxx.xxx.xxx.xxx@o2ib2, queue depth too large:  42 (<=32 wanted)

—
Dan Szkola
FNAL


> On Apr 11, 2024, at 12:36 PM, Andreas Dilger  wrote:
> 
> [EXTERNAL] – This message is from an external sender
> 
> 
> On Apr 11, 2024, at 09:56, Daniel Szkola via lustre-discuss 
>  wrote:
>> 
>> Hello all,
>> 
>> I recently discovered some mismatches in our /etc/modprobe.d/ko2iblnd.conf 
>> files between our clients and servers.
>> 
>> Is it now recommended to keep the defaults on this module and run without a 
>> config file or are there recommended numbers for lustre-2.15.X?
>> 
>> The only thing I’ve seen that provides any guidance is the Lustre wiki and 
>> an HP/Cray doc:
>> 
>> https://www.hpe.com/psnow/resources/ebooks/a00113867en_us_v2/Lustre_Server_Recommended_Tuning_Parameters_4.x.html
>> 
>> Anyone have any sage advice on what the ko2iblnd.conf (and possibly 
>> ko2iblnd-opa.conf and hfi1.conf as well) on modern systems?
> 
> It would be useful to know what specific settings are mismatched.  Definitely 
> some of them need to be consistent between peers, others depend on your 
> system.
> 
> Cheers, Andreas
> --
> Andreas Dilger
> Lustre Principal Architect
> Whamcloud
> 
> 
> 
> 
> 
> 
> 

___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] ko2iblnd.conf

2024-04-11 Thread Andreas Dilger via lustre-discuss
On Apr 11, 2024, at 09:56, Daniel Szkola via lustre-discuss 
mailto:lustre-discuss@lists.lustre.org>> wrote:

Hello all,

I recently discovered some mismatches in our /etc/modprobe.d/ko2iblnd.conf 
files between our clients and servers.

Is it now recommended to keep the defaults on this module and run without a 
config file or are there recommended numbers for lustre-2.15.X?

The only thing I’ve seen that provides any guidance is the Lustre wiki and an 
HP/Cray doc:

https://www.hpe.com/psnow/resources/ebooks/a00113867en_us_v2/Lustre_Server_Recommended_Tuning_Parameters_4.x.html

Anyone have any sage advice on what the ko2iblnd.conf (and possibly 
ko2iblnd-opa.conf and hfi1.conf as well) on modern systems?

It would be useful to know what specific settings are mismatched.  Definitely 
some of them need to be consistent between peers, others depend on your system.

Cheers, Andreas
--
Andreas Dilger
Lustre Principal Architect
Whamcloud







___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


[lustre-discuss] ko2iblnd.conf

2024-04-11 Thread Daniel Szkola via lustre-discuss
Hello all,

I recently discovered some mismatches in our /etc/modprobe.d/ko2iblnd.conf 
files between our clients and servers.

Is it now recommended to keep the defaults on this module and run without a 
config file or are there recommended numbers for lustre-2.15.X?

The only thing I’ve seen that provides any guidance is the Lustre wiki and an 
HP/Cray doc:

https://www.hpe.com/psnow/resources/ebooks/a00113867en_us_v2/Lustre_Server_Recommended_Tuning_Parameters_4.x.html

Anyone have any sage advice on what the ko2iblnd.conf (and possibly 
ko2iblnd-opa.conf and hfi1.conf as well) on modern systems?

—
Dan Szkola
FNAL
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org