Re: [FEniCS] mpi groups via petsc4py

Garth N. Wells Wed, 26 Nov 2014 00:44:17 -0800

On Wed, 26 Nov, 2014 at 8:32 AM, Johan Hake <[email protected]> wrote:

On Wed, Nov 26, 2014 at 9:22 AM, Garth N. Wells <[email protected]>wrote:
On Wed, 26 Nov, 2014 at 7:50 AM, Johan Hake <[email protected]>wrote:
On Wed, Nov 26, 2014 at 8:34 AM, Garth N. Wells <[email protected]>wrote:
On Tue, 25 Nov, 2014 at 9:48 PM, Johan Hake <[email protected]>wrote:
Hello!
I just pushed some fixes to the jit interface of DOLFIN. Now onecan jit on different mpi groups.
Nice.
Previously jiting was only done on rank 1 of the mpi_comm_world.Now it is done on rank 1 of any passed group communicator.
Do you mean rank 0?
Yes, of course.
There is no demo atm showing this but a test has been added:

  test/unit/python/jit/test_jit_with_mpi_groups.py
Here an expression, a subdomain, and a form is constructed ondifferent ranks using group. It is somewhat tedious as one needto initialize PETSc with the same group, otherwise PETSc willdeadlock during initialization (the moment a PETSc la object isconstructed).
This is ok. It's arguably a design flaw that we don't make theuser handle MPI initialisation manually.
Sure, it is just somewhat tedious. You cannot start your typicalscript with importing dolfin.
The procedure in Python for this is:

1) Construct mpi groups using mpi4py
2) Initalize petscy4py using the groups
3) Wrap groups to petsc4py comm (dolfin only support petsc4py notmpi4py)
4) import dolfin
5) Do group specific stuff:
   a) Function and forms no change needed as communicator
      is passed via mesh
   b) domain = CompiledSubDomain("...", mpi_comm=group_comm)
   c) e = Expression("...", mpi_comm=group_comm)
It's not so clear whether passing the communicator means that theExpression is only defined/available on group_comm, or ifgroup_comm is simply to control who does the JIT. Could youclarify this?
My knowledge is not that good in MPI. I have only tried to access(and construct) the Expression on ranks included in that group.Also when I tried construct one using a group communicator on arank that is not included in the group, I got an when callingMPI_size on it. There is probably a perfectly reasonableexplaination to this.
Could you clarify what goes on behind-the-scenes with thecommunicator? Is it only used in a call to get the process rank?What do the ranks other than zero do?
Not sure what you want to know. Instead of using mpi_comm_world toconstruct meshes you use the group communicator. This communicatorhas its own local group of ranks. JITing is still done on rank 0of the local group, which might and most often is different from rank0 process of the mpi_comm_word.


I just want to be clear (and have in the docstring) that

   e = Expression("...", mpi_comm=group_comm)

is valid only on group_comm (if this is the case), or make clear thatthe communicator only determines the process that does the JIT.

If we required all Expressions to have a domain/mesh, as Martinadvocates, things would be clearer.

The group communicator works exactly like the world communicator butnow on just a subset of the processes. There were some sharp edgeswith deadlocks as a consequence, when barriers were taken on theworld communicator. This is done by default when dolfin is importedand petcs gets initialized with the world communicator. So we need toinitialized petsc using the group communicator. Other than that thereare not real differences.

That doesn't sound right. PETSc initialisation does not take acommunicator. It is collective on MPI_COMM_WORLD, but each PETSc objecttakes a communicator at construction, which can be something other thanMPI_COMM_WORLD or MPI_COMM_SELF.


Garth

Johan
Garth
Please try it out and report any sharp edges. A demo would alsobe fun to include :)
We could run tests on different communicators to speed them up onmachines with high core counts!
True!

Johan
Garth
Johan


_______________________________________________
fenics mailing list
[email protected]
http://fenicsproject.org/mailman/listinfo/fenics

Re: [FEniCS] mpi groups via petsc4py

Reply via email to