Re: [OMPI users] Using MPI in Toro unikernel

2022-07-24 Thread Matias Vara via users
Hello Gilles,

Assuming you run one MPI task per unikernel, and two unikernels share
> nothing,
> it means that inter-node communication cannot be performed via shared
> memory or kernel feature
> (such as xpmem or knem). That also implies communication are likely using
> the loopback interface
> which is much slower.


Let me elaborate on it. I have one VM in which I have the unikernel. The VM
has many cores and each core has a single thread. Each thread runs a MPI
task. Each MPI task relies on shared-memory to communicate, i.e., VirtIO.
This is a shared-nothing unikernel because each core runs an instance of
the kernel that shares nothing with the other cores in terms of memory and
resources.

Regards, Matias.


Re: [OMPI users] Using MPI in Toro unikernel

2022-07-24 Thread Gilles Gouaillardet via users
Matias,

Assuming you run one MPI task per unikernel, and two unikernels share
nothing,
it means that inter-node communication cannot be performed via shared
memory or kernel feature
(such as xpmem or knem). That also implies communication are likely using
the loopback interface
which is much slower.

Cheers,

Gilles

On Sun, Jul 24, 2022 at 8:11 PM Matias Ezequiel Vara Larsen via users <
users@lists.open-mpi.org> wrote:

> Hello everyone,
>
> I have started to play with MPI and unikernels and I have recently
> implemented a minimal set of MPI APIs on top of Toro Unikernel
> (
> https://github.com/torokernel/torokernel/blob/example-mpi/examples/MPI/MpiReduce.pas
> ).
> I was wondering if someone may be interested in the use of unikernels to
> deploy MPI applications. Toro is a shared-nothing unikernel in which
> each core runs independently from one another. Also, memory is per-core
> to leverage NUMA. I was thinking that those features may improve the
> execution of MPI applications but I have not measured that yet. For the
> moment, I am running a simple MPI reduction with the MPI_SUM operation
> and watching how this behaves when the number of cores increases. Do you
> know any benchmark that I can run so try to test that?
>
> Matias
>


[OMPI users] Using MPI in Toro unikernel

2022-07-24 Thread Matias Ezequiel Vara Larsen via users
Hello everyone,

I have started to play with MPI and unikernels and I have recently
implemented a minimal set of MPI APIs on top of Toro Unikernel
(https://github.com/torokernel/torokernel/blob/example-mpi/examples/MPI/MpiReduce.pas).
I was wondering if someone may be interested in the use of unikernels to
deploy MPI applications. Toro is a shared-nothing unikernel in which
each core runs independently from one another. Also, memory is per-core
to leverage NUMA. I was thinking that those features may improve the
execution of MPI applications but I have not measured that yet. For the
moment, I am running a simple MPI reduction with the MPI_SUM operation
and watching how this behaves when the number of cores increases. Do you
know any benchmark that I can run so try to test that?

Matias