I'm not sure it is best to say that "the standard way to handle this" is
to partition the elements. Minimization of communication calls for
partitioning the nodes (at the expense of performing extra element
computations).
On 2/26/17 8:15 PM, Barry Smith wrote:
The standard way to handle this for finite elements is to partition the
elements among the processes and then partition the nodes (rows of the degrees
of freedom) subservient to the partitioning of the elements. Otherwise most of
the matrix (or vector) entries must be communicated and this is not scalable.
So how are you partitioning the elements (for matrix stiffness
computations) and the nodes between processes?