[petsc-dev] Non-scalable matrix operations

Jed Brown Fri, 23 Dec 2011 12:55:22 -0600

On Fri, Dec 23, 2011 at 12:27, Mark F. Adams <mark.adams at columbia.edu>wrote:


> A more interesting thing is partition down to the thread level and keep
> about 100 vertices per thread (this might be to big for a GPU...)


It's fine to have more partitions than threads.


> and then use locks of some sort for the shared memory synchronization


It can be lock-free, your thread just waits until a buffer has been marked
as updated. Since the reader/writer relationships are predefined, it's not
actually a lock. (You can do more general methods lock-free too.)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
<http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20111223/6620fb5b/attachment.html>

[petsc-dev] Non-scalable matrix operations

Reply via email to