https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87622
--- Comment #4 from Andreas Klein <klein at cage dot ugent.be> --- On Wed, 17 Oct 2018, burnus at gcc dot gnu.org wrote: > > If "b" (and "a") are allocated in some slower memory part, it matters how one > sums over the variables in the matmul loop. I know that there are difference in speed. But a factor 2 is realy large and should not happen. I created the example as a minimal example. I observerd similar problems in almost every coarray program I have tried. The result is always that the use coarrays of brings not the desired speed up and is just a wast of resouces.