Are you using DArrays? You should be able to move data with indexing. For the Cannon algorithm, you should be able to organize your communication so that each processor moves the data it needs - IIRC.
-viral On Saturday, June 21, 2014 11:08:06 PM UTC+5:30, Pietro Benedusi wrote: > > Hello, > > I need to write a distributed Cannon algorithm for matrix multiplication. > In every iteration I have to shift all the blocks of the involved matrices > or equivalently to move blocks between remote procs. How can I move blocks > from a remote proc to an other? > > Thnaks >
