[petsc-users] Parallel processes run significantly slower

Steffen Wilksen | Universitaet Bremen Thu, 11 Jan 2024 08:28:32 -0800

Hi all,

I'm trying to do repeated matrix-vector-multiplication of large sparsematrices in python using petsc4py. Even the most simple method ofparallelization, dividing up the calculation to run on multipleprocesses indenpendtly, does not seem to give a singnificant speed upfor large matrices. I constructed a minimal working example, which Irun using


mpiexec -n N python parallel_example.py,

where N is the number of processes. Instead of taking approximatelythe same time irrespective of the number of processes used, thecalculation is much slower when starting more MPI processes. Thistranslates to little to no speed up when splitting up a fixed numberof calculations over N processes. As an example, running with N=1takes 9s, while running with N=4 takes 34s. When running with smallermatrices, the problem is not as severe (only slower by a factor of 1.5when setting MATSIZE=1e+5 instead of MATSIZE=1e+6). I get the sameproblems when just starting the script four times manually withoutusing MPI.I attached both the script and the log file for running the scriptwith N=4. Any help would be greatly appreciated. Calculations are doneon my laptop, arch linux version 6.6.8 and PETSc version 3.20.2.


Kind Regards
Steffen

import sys
import time
import petsc4py
petsc4py.init(sys.argv)
from petsc4py import PETSc

MATSIZE = 1e+6
mat = PETSc.Mat().createAIJ((MATSIZE, MATSIZE), comm=PETSc.COMM_SELF)
mat.setPreallocationNNZ(50)
mat.setRandom()
mat.assemble()

x, b = mat.createVecs()
x.setRandom()

time_start = time.time()
for _ in range(100):
    mat.mult(x, b)

if PETSc.COMM_WORLD.rank == 0:
    print(f"{time.time() - time_start:.2f}s")

[petsc-users] Parallel processes run significantly slower

Reply via email to