Re: Mir vs. Numpy: Reworked!

9il via Digitalmars-d-announce Sat, 05 Dec 2020 00:10:57 -0800

On Friday, 4 December 2020 at 20:26:17 UTC, data pulverizer wrote:

On Friday, 4 December 2020 at 14:48:32 UTC, jmh530 wrote:
It looks like all the `sweep_XXX` functions are only definedfor contiguous slices, as that would be the default if definea Slice!(T, N).
How the functions access the data is a big difference. If youcompare the `sweep_field` version with the `sweep_naive`version, the `sweep_field` function is able to access throughone index, whereas the `sweep_naive` function has to use twoin the 2d version and 3 in the 3d version.
Also, the main difference in the NDSlice version is that ituses *built-in* MIR functionality, like how `sweep_ndslice`uses the `each` function from MIR, whereas `sweep_field` usesa for loop. I think this is partially to show that thebuilt-in MIR functionality is as fast as if you tried to do itwith a for loop yourself.
I see, looking at some of the code, field case is literallydoing the indexing calculation right there. I guess ndslice isdoing the same thing just with "Mir magic" an in the background?

sweep_ndslice uses (2*N - 1) arrays to index U, this allows LDCto unroll the loop.


More details here
https://forum.dlang.org/post/[email protected]

I'm still not sure why slice is so slow. Doesn't thatcompletely rely on the opSlice implementations? The choice ofindexing method and underlying data structure?

sweep_slice is slower because it iterates data in few loopsrather than in a single one. For small matrices this makesJMP/FLOP ratio higher, for large matrices that can't feet intothe CPU cache, it is less memory efficient.

Re: Mir vs. Numpy: Reworked!

Reply via email to